Blame view

node_modules/chardet/README.md 1.31 KB
ce4c83ff   wxy   初始提交
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
  
  chardet [![Build Status](https://travis-ci.org/runk/node-chardet.png)](https://travis-ci.org/runk/node-chardet)
  =====
  
  Chardet is a character detection module for NodeJS written in pure Javascript.
  Module is based on ICU project http://site.icu-project.org/, which uses character
  occurency analysis to determine the most probable encoding.
  
  ## Installation
  
  ```
  npm i chardet
  ```
  
  ## Usage
  
  ```javascript
  var chardet = require('chardet');
  chardet.detect(new Buffer('hello there!'));
  // or
  chardet.detectFile('/path/to/file', function(err, encoding) {});
  // or
  chardet.detectFileSync('/path/to/file');
  ```
  
  ## Working with large data sets
  
  Sometimes, when data set is huge and you want to optimize performace (in tradeoff of less accuracy), 
  you can sample only first N bytes of the buffer:
  
  ```javascript
  chardet.detectFile('/path/to/file', { sampleSize: 32 }, function(err, encoding) {});
  ```
  
  ## Supported Encodings:
  
  * UTF-8
  * UTF-16 LE
  * UTF-16 BE
  * UTF-32 LE
  * UTF-32 BE
  * ISO-2022-JP
  * ISO-2022-KR
  * ISO-2022-CN
  * Shift-JIS
  * Big5
  * EUC-JP
  * EUC-KR
  * GB18030
  * ISO-8859-1
  * ISO-8859-2
  * ISO-8859-5
  * ISO-8859-6
  * ISO-8859-7
  * ISO-8859-8
  * ISO-8859-9
  * windows-1250
  * windows-1251
  * windows-1252
  * windows-1253
  * windows-1254
  * windows-1255
  * windows-1256
  * KOI8-R
  
  Currently only these encodings are supported, more will be added soon.