Bug 103571 - [NEW PORT] textproc/p5-Text-Language-Guess: Trained module to guess a document's language
Summary: [NEW PORT] textproc/p5-Text-Language-Guess: Trained module to guess a documen...
Status: Closed FIXED
Alias: None
Product: Ports & Packages
Classification: Unclassified
Component: Individual Port(s) (show other bugs)
Version: Latest
Hardware: Any Any
: Normal Affects Only Me
Assignee: Martin Wilke
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-09-24 18:10 UTC by markun
Modified: 2006-09-24 21:00 UTC (History)
1 user (show)

See Also:


Attachments
file.shar (3.21 KB, text/plain)
2006-09-24 18:10 UTC, markun
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description markun 2006-09-24 18:10:25 UTC
Text::Language::Guess guesses a document's language. Its implementation
is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
it determines how many of the known stopwords the document contains for
each language supported by "Lingua::StopWords".

Each word in the document recognized as stopword of a particular
language scores one point for this language.

The "language_guess()" function takes a document as a parameter and
returns the abbreviation of the language that it is most likely written
in.

Author: Mike Schilli <cpan@perlmeister.com>
WWW:    http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/
Comment 1 Martin Wilke freebsd_committer 2006-09-24 20:46:17 UTC
Responsible Changed
From-To: freebsd-ports-bugs->miwi

I'll take it.
Comment 2 dfilter service freebsd_committer 2006-09-24 20:52:05 UTC
miwi        2006-09-24 19:51:59 UTC

  FreeBSD ports repository

  Modified files:
    textproc             Makefile 
  Added files:
    textproc/p5-Text-Language-Guess Makefile distinfo pkg-descr 
                                    pkg-plist 
  Log:
  Text::Language::Guess guesses a document's language. Its implementation
  is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
  it determines how many of the known stopwords the document contains for
  each language supported by "Lingua::StopWords".
  
  Each word in the document recognized as stopword of a particular
  language scores one point for this language.
  
  The "language_guess()" function takes a document as a parameter and
  returns the abbreviation of the language that it is most likely written
  in.
  
  Author: Mike Schilli <cpan@perlmeister.com>
  WWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/
  
  PR:             ports/103571
  Submitted by:   Masahiro Teramoto <markun@onohara.to>
  
  Revision  Changes    Path
  1.1062    +1 -0      ports/textproc/Makefile
  1.1       +34 -0     ports/textproc/p5-Text-Language-Guess/Makefile (new)
  1.1       +3 -0      ports/textproc/p5-Text-Language-Guess/distinfo (new)
  1.1       +14 -0     ports/textproc/p5-Text-Language-Guess/pkg-descr (new)
  1.1       +9 -0      ports/textproc/p5-Text-Language-Guess/pkg-plist (new)
_______________________________________________
cvs-all@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/cvs-all
To unsubscribe, send any mail to "cvs-all-unsubscribe@freebsd.org"
Comment 3 Martin Wilke freebsd_committer 2006-09-24 20:53:08 UTC
State Changed
From-To: open->closed

New port added, with minor changes. Thanks!