Detected encoding is wrong with DetectFromBytes, ok with other methods for UTF-8 file containing emoji

Test program launched from latest source:

```c#
            string filename = args[0];

            var result = CharsetDetector.DetectFromFile(filename);

            if (result.Detected != null)
            {
                Console.WriteLine("DetectFromFile - Charset: {0}, confidence: {1}", result.Detected.EncodingName, result.Detected.Confidence);
            }

            byte[] bytes = System.IO.File.ReadAllBytes(filename);
            result = CharsetDetector.DetectFromBytes(bytes);

            if (result.Detected != null)
            {
                Console.WriteLine("DetectFromBytes - Charset: {0}, confidence: {1}", result.Detected.EncodingName, result.Detected.Confidence);
            }

            System.IO.Stream fileStream = System.IO.File.OpenRead(filename);
            result = CharsetDetector.DetectFromStream(fileStream);

            if (result.Detected != null)
            {
                Console.WriteLine("DetectFromStream - Charset: {0}, confidence: {1}", result.Detected.EncodingName, result.Detected.Confidence);
            }

```

Result:
![image](https://user-images.githubusercontent.com/305637/29879908-b6486e48-8da6-11e7-874b-399c3b502350.png)


The file is a HTML UTF-8 (without BOM) encoded file containing 1 simple emoji : 😀
(attached in the zip below) 
[utf8_with_emoji.zip](https://github.com/CharsetDetector/UTF-unknown/files/1264089/utf8_with_emoji.zip)


Why does the `DetectFromBytes` method gives a different result?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detected encoding is wrong with DetectFromBytes, ok with other methods for UTF-8 file containing emoji #38

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Detected encoding is wrong with DetectFromBytes, ok with other methods for UTF-8 file containing emoji #38

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions