Character encoding detection tool for NodeJS
Go to file
Dmitry Shirokov ef30efcc27 mooore tests, jp/cn/ko multibyte encodings 2013-04-30 23:49:02 +10:00
encoding mooore tests, jp/cn/ko multibyte encodings 2013-04-30 23:49:02 +10:00
test mooore tests, jp/cn/ko multibyte encodings 2013-04-30 23:49:02 +10:00
.gitignore mooore tests, jp/cn/ko multibyte encodings 2013-04-30 23:49:02 +10:00
README.md mooore tests, jp/cn/ko multibyte encodings 2013-04-30 23:49:02 +10:00
index.js mooore tests, jp/cn/ko multibyte encodings 2013-04-30 23:49:02 +10:00
match.js basic tests, improvements 2013-04-30 00:31:48 +10:00
package.json mooore tests, jp/cn/ko multibyte encodings 2013-04-30 23:49:02 +10:00

README.md

chardet - nodejs characted detection module, written in pure Javascript

Installation

npm install chardet

Usage

var chardet = require('chardet');
chardet.detect(new Buffer('hello there!'));
// or
chardet.detectFile('/path/to/file', function(err, encoding) {});
// or
chardet.detectFileSync('/path/to/file');

Supported Encodings:

  • UTF-8
  • UTF-16 LE
  • UTF-16 BE
  • UTF-32 LE
  • UTF-32 BE
  • ISO-2022-JP
  • ISO-2022-KR
  • ISO-2022-CN
  • Shift-JIS
  • Big5
  • EUC-JP
  • EUC-KR
  • GB18030

Currently only these encodings are supported, more will be added soon.