diff options
author | ThibG <thib@sitedethib.com> | 2018-12-17 19:19:45 +0100 |
---|---|---|
committer | Eugen Rochko <eugen@zeonfederated.com> | 2018-12-17 19:19:45 +0100 |
commit | e709b8da0d685d3cc48d430a9761896094f67d72 (patch) | |
tree | 8d60d548237c7478bb974e2813288fdda11a8862 /spec/fixtures | |
parent | 4ede51743e5b9121a49e9131f91cf012fab410f8 (diff) |
Ignore low-confidence CharlockHolmes guesses when parsing link cards (#9510)
* Add failing test for windows-1251 link cards
* Ignore low-confidence CharlockHolmes guesses
Fixes #9466
* Fix no method error when charlock holmes cannot detect charset
Diffstat (limited to 'spec/fixtures')
-rw-r--r-- | spec/fixtures/requests/windows-1251.txt | 17 |
1 files changed, 17 insertions, 0 deletions
diff --git a/spec/fixtures/requests/windows-1251.txt b/spec/fixtures/requests/windows-1251.txt new file mode 100644 index 00000000000..f573e28b249 --- /dev/null +++ b/spec/fixtures/requests/windows-1251.txt @@ -0,0 +1,17 @@ +HTTP/1.1 200 OK
+server: nginx
+date: Wed, 12 Dec 2018 13:14:03 GMT
+content-type: text/html
+content-length: 190
+accept-ranges: bytes
+
+<!DOCTYPE html>
+<html>
+<head>
+ <meta http-equiv="Content-Type" content="text/html; charset=windows-1251" />
+ <title>сэмпл текст</title>
+</head>
+<body>
+ <p>сэмпл текст</p>
+</body>
+</html>
|