Bug 195083 - [NEW PORT] [textproc/uchardet] Universal charset detection library by Mozilla
Summary: [NEW PORT] [textproc/uchardet] Universal charset detection library by Mozilla
Status: Closed FIXED
Alias: None
Product: Ports & Packages
Classification: Unclassified
Component: Individual Port(s) (show other bugs)
Version: Latest
Hardware: Any Any
: --- Affects Only Me
Assignee: Pawel Pekala
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-11-16 21:10 UTC by Yuri Victorovich
Modified: 2014-12-06 14:52 UTC (History)
1 user (show)

See Also:


Attachments
patch (2.22 KB, patch)
2014-11-16 21:10 UTC, Yuri Victorovich
no flags Details | Diff
poudriere log (58.54 KB, text/plain)
2014-11-16 21:11 UTC, Yuri Victorovich
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Yuri Victorovich freebsd_committer 2014-11-16 21:10:30 UTC
Created attachment 149493 [details]
patch

This is the library that is able to guess what encoding the text is in.

Despite the gradual demise of non-UTF-* encodings, this library is very useful for programs that need to guess the encoding of some legacy text data (like subtitles in video players like mplayer, etc).

This codebase was used for a long time within the Firefox browser (their "guess encoding" feature).
Comment 1 Yuri Victorovich freebsd_committer 2014-11-16 21:11:51 UTC
Created attachment 149494 [details]
poudriere log
Comment 2 Yuri Victorovich freebsd_committer 2014-11-17 20:47:24 UTC
The related (by functionality) project is converters/enca It detects mostly European charsets.
converters/ might be another category to consider, except textproc/uchardet doesn't convert anything, only outputs the detected charset type.
Comment 3 commit-hook freebsd_committer 2014-12-06 14:47:09 UTC
A commit references this bug:

Author: pawel
Date: Sat Dec  6 14:47:04 UTC 2014
New revision: 374113
URL: https://svnweb.freebsd.org/changeset/ports/374113

Log:
  uchardet is a C language binding of the original C++ implementation
  of the universal charset detection library by Mozilla.

  WWW: https://code.google.com/p/uchardet/

  PR:		195083
  Submitted by:	Yuri Victorovich <yuri@rawbw.com>

Changes:
  head/textproc/Makefile
  head/textproc/uchardet/
  head/textproc/uchardet/Makefile
  head/textproc/uchardet/distinfo
  head/textproc/uchardet/files/
  head/textproc/uchardet/files/patch-CMakeLists.txt
  head/textproc/uchardet/pkg-descr
  head/textproc/uchardet/pkg-plist
Comment 4 commit-hook freebsd_committer 2014-12-06 14:52:10 UTC
A commit references this bug:

Author: pawel
Date: Sat Dec  6 14:51:11 UTC 2014
New revision: 46066
URL: https://svnweb.freebsd.org/changeset/doc/46066

Log:
  For textproc/uchardet

  PR:		195083

Changes:
  head/en_US.ISO8859-1/articles/contributors/contrib.additional.xml