Bug 103571

Summary: [NEW PORT] textproc/p5-Text-Language-Guess: Trained module to guess a document's language
Product: Ports & Packages Reporter: markun
Component: Individual Port(s)Assignee: Martin Wilke <miwi>
Status: Closed FIXED    
Severity: Affects Only Me CC: markun
Priority: Normal    
Version: Latest   
Hardware: Any   
OS: Any   
Attachments:
Description Flags
file.shar none

Description markun 2006-09-24 18:10:25 UTC
Text::Language::Guess guesses a document's language. Its implementation
is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
it determines how many of the known stopwords the document contains for
each language supported by "Lingua::StopWords".

Each word in the document recognized as stopword of a particular
language scores one point for this language.

The "language_guess()" function takes a document as a parameter and
returns the abbreviation of the language that it is most likely written
in.

Author: Mike Schilli <cpan@perlmeister.com>
WWW:    http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/
Comment 1 Martin Wilke freebsd_committer freebsd_triage 2006-09-24 20:46:17 UTC
Responsible Changed
From-To: freebsd-ports-bugs->miwi

I'll take it.
Comment 2 dfilter service freebsd_committer freebsd_triage 2006-09-24 20:52:05 UTC
miwi        2006-09-24 19:51:59 UTC

  FreeBSD ports repository

  Modified files:
    textproc             Makefile 
  Added files:
    textproc/p5-Text-Language-Guess Makefile distinfo pkg-descr 
                                    pkg-plist 
  Log:
  Text::Language::Guess guesses a document's language. Its implementation
  is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
  it determines how many of the known stopwords the document contains for
  each language supported by "Lingua::StopWords".
  
  Each word in the document recognized as stopword of a particular
  language scores one point for this language.
  
  The "language_guess()" function takes a document as a parameter and
  returns the abbreviation of the language that it is most likely written
  in.
  
  Author: Mike Schilli <cpan@perlmeister.com>
  WWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/
  
  PR:             ports/103571
  Submitted by:   Masahiro Teramoto <markun@onohara.to>
  
  Revision  Changes    Path
  1.1062    +1 -0      ports/textproc/Makefile
  1.1       +34 -0     ports/textproc/p5-Text-Language-Guess/Makefile (new)
  1.1       +3 -0      ports/textproc/p5-Text-Language-Guess/distinfo (new)
  1.1       +14 -0     ports/textproc/p5-Text-Language-Guess/pkg-descr (new)
  1.1       +9 -0      ports/textproc/p5-Text-Language-Guess/pkg-plist (new)
_______________________________________________
cvs-all@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/cvs-all
To unsubscribe, send any mail to "cvs-all-unsubscribe@freebsd.org"
Comment 3 Martin Wilke freebsd_committer freebsd_triage 2006-09-24 20:53:08 UTC
State Changed
From-To: open->closed

New port added, with minor changes. Thanks!