Bug 181992 - New port: textproc/py-pdfminer PDF parser and analyzer
Summary: New port: textproc/py-pdfminer PDF parser and analyzer
Status: Closed FIXED
Alias: None
Product: Ports & Packages
Classification: Unclassified
Component: Individual Port(s) (show other bugs)
Version: Latest
Hardware: Any Any
: Normal Affects Only Me
Assignee: Po-Chuan Hsieh
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2013-09-10 16:00 UTC by Serhii (Sergey) Kozlov
Modified: 2013-11-29 20:10 UTC (History)
0 users

See Also:


Attachments
file.shar (5.01 KB, text/plain)
2013-09-10 16:00 UTC, Serhii (Sergey) Kozlov
no flags Details

Note You need to log in before you can comment on or make changes to this bug.
Description Serhii (Sergey) Kozlov freebsd_committer freebsd_triage 2013-09-10 16:00:00 UTC
New port: textproc/py-pdfminer

PDFMiner is a tool for extracting information from PDF documents. Unlike other
PDF-related tools, it focuses entirely on getting and analyzing text data.
PDFMiner allows to obtain the exact location of texts in a page, as well as
other information such as fonts or lines. It includes a PDF converter that can
transform PDF files into other text formats (such as HTML).
It has an extensible PDF parser that can be used for other purposes instead
of text analysis.

Tested with FreeBSD Port Tools 0.99_8, looks fine.

Fix: Patch attached with submission follows:
Comment 1 Edwin Groothuis freebsd_committer freebsd_triage 2013-09-10 16:00:08 UTC
Responsible Changed
From-To: freebsd-ports-bugs->freebsd-python

freebsd-python@ wants this port PRs (via the GNATS Auto Assign Tool)
Comment 2 Po-Chuan Hsieh freebsd_committer freebsd_triage 2013-11-29 18:55:44 UTC
Responsible Changed
From-To: freebsd-python->sunpoet

I'll take it.
Comment 3 dfilter service freebsd_committer freebsd_triage 2013-11-29 20:02:19 UTC
Author: sunpoet
Date: Fri Nov 29 20:02:05 2013
New Revision: 335218
URL: http://svnweb.freebsd.org/changeset/ports/335218

Log:
  - Add py-pdfminer
  - While I'm here, update to 20131113
  
  PDFMiner is a tool for extracting information from PDF documents. Unlike other
  PDF-related tools, it focuses entirely on getting and analyzing text data.
  PDFMiner allows to obtain the exact location of texts in a page, as well as
  other information such as fonts or lines. It includes a PDF converter that can
  transform PDF files into other text formats (such as HTML).
  It has an extensible PDF parser that can be used for other purposes instead
  of text analysis.
  
  WWW: http://www.unixuser.org/~euske/python/pdfminer/index.html
  
  PR:		ports/181992
  Submitted by:	Kozlov Sergey <kozlov.sergey.404@gmail.com>

Added:
  head/textproc/py-pdfminer/
  head/textproc/py-pdfminer/Makefile   (contents, props changed)
  head/textproc/py-pdfminer/distinfo   (contents, props changed)
  head/textproc/py-pdfminer/pkg-descr   (contents, props changed)
  head/textproc/py-pdfminer/pkg-plist   (contents, props changed)
Modified:
  head/textproc/Makefile

Modified: head/textproc/Makefile
==============================================================================
--- head/textproc/Makefile	Fri Nov 29 19:56:51 2013	(r335217)
+++ head/textproc/Makefile	Fri Nov 29 20:02:05 2013	(r335218)
@@ -1175,6 +1175,7 @@
     SUBDIR += py-markdown2
     SUBDIR += py-openpyxl
     SUBDIR += py-paragrep
+    SUBDIR += py-pdfminer
     SUBDIR += py-pss
     SUBDIR += py-pyctpp2
     SUBDIR += py-pyelasticsearch

Added: head/textproc/py-pdfminer/Makefile
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ head/textproc/py-pdfminer/Makefile	Fri Nov 29 20:02:05 2013	(r335218)
@@ -0,0 +1,18 @@
+# Created by: Kozlov Sergey <kozlov.sergey.404@gmail.com>
+# $FreeBSD$
+
+PORTNAME=	pdfminer
+PORTVERSION=	20131113
+CATEGORIES=	textproc python
+MASTER_SITES=	CHEESESHOP
+PKGNAMEPREFIX=	${PYTHON_PKGNAMEPREFIX}
+
+MAINTAINER=	kozlov.sergey.404@gmail.com
+COMMENT=	PDF parser and analyzer
+
+LICENSE=	MIT
+
+USE_PYTHON=	-2.7
+USE_PYDISTUTILS=yes
+
+.include <bsd.port.mk>

Added: head/textproc/py-pdfminer/distinfo
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ head/textproc/py-pdfminer/distinfo	Fri Nov 29 20:02:05 2013	(r335218)
@@ -0,0 +1,2 @@
+SHA256 (pdfminer-20131113.tar.gz) = 1016246265f9d48645229164ef44483e9b9a1b3e6902783782f9ae4e1024ea19
+SIZE (pdfminer-20131113.tar.gz) = 4080763

Added: head/textproc/py-pdfminer/pkg-descr
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ head/textproc/py-pdfminer/pkg-descr	Fri Nov 29 20:02:05 2013	(r335218)
@@ -0,0 +1,9 @@
+PDFMiner is a tool for extracting information from PDF documents. Unlike other
+PDF-related tools, it focuses entirely on getting and analyzing text data.
+PDFMiner allows to obtain the exact location of texts in a page, as well as
+other information such as fonts or lines. It includes a PDF converter that can
+transform PDF files into other text formats (such as HTML).
+It has an extensible PDF parser that can be used for other purposes instead
+of text analysis.
+
+WWW: http://www.unixuser.org/~euske/python/pdfminer/index.html

Added: head/textproc/py-pdfminer/pkg-plist
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ head/textproc/py-pdfminer/pkg-plist	Fri Nov 29 20:02:05 2013	(r335218)
@@ -0,0 +1,79 @@
+bin/dumppdf.py
+bin/latin2ascii.py
+bin/pdf2txt.py
+%%PYTHON_SITELIBDIR%%/pdfminer/__init__.py
+%%PYTHON_SITELIBDIR%%/pdfminer/__init__.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/__init__.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/arcfour.py
+%%PYTHON_SITELIBDIR%%/pdfminer/arcfour.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/arcfour.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/ascii85.py
+%%PYTHON_SITELIBDIR%%/pdfminer/ascii85.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/ascii85.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/ccitt.py
+%%PYTHON_SITELIBDIR%%/pdfminer/ccitt.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/ccitt.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/cmapdb.py
+%%PYTHON_SITELIBDIR%%/pdfminer/cmapdb.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/cmapdb.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/converter.py
+%%PYTHON_SITELIBDIR%%/pdfminer/converter.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/converter.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/encodingdb.py
+%%PYTHON_SITELIBDIR%%/pdfminer/encodingdb.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/encodingdb.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/fontmetrics.py
+%%PYTHON_SITELIBDIR%%/pdfminer/fontmetrics.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/fontmetrics.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/glyphlist.py
+%%PYTHON_SITELIBDIR%%/pdfminer/glyphlist.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/glyphlist.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/image.py
+%%PYTHON_SITELIBDIR%%/pdfminer/image.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/image.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/latin_enc.py
+%%PYTHON_SITELIBDIR%%/pdfminer/latin_enc.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/latin_enc.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/layout.py
+%%PYTHON_SITELIBDIR%%/pdfminer/layout.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/layout.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/lzw.py
+%%PYTHON_SITELIBDIR%%/pdfminer/lzw.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/lzw.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfcolor.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfcolor.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfcolor.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfdevice.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfdevice.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfdevice.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfdocument.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfdocument.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfdocument.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdffont.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdffont.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdffont.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfinterp.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfinterp.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfinterp.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfpage.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfpage.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfpage.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfparser.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfparser.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdfparser.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/pdftypes.py
+%%PYTHON_SITELIBDIR%%/pdfminer/pdftypes.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/pdftypes.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/psparser.py
+%%PYTHON_SITELIBDIR%%/pdfminer/psparser.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/psparser.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/rijndael.py
+%%PYTHON_SITELIBDIR%%/pdfminer/rijndael.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/rijndael.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/runlength.py
+%%PYTHON_SITELIBDIR%%/pdfminer/runlength.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/runlength.pyo
+%%PYTHON_SITELIBDIR%%/pdfminer/utils.py
+%%PYTHON_SITELIBDIR%%/pdfminer/utils.pyc
+%%PYTHON_SITELIBDIR%%/pdfminer/utils.pyo
+@dirrmtry %%PYTHON_SITELIBDIR%%/pdfminer
_______________________________________________
svn-ports-all@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/svn-ports-all
To unsubscribe, send any mail to "svn-ports-all-unsubscribe@freebsd.org"
Comment 4 Po-Chuan Hsieh freebsd_committer freebsd_triage 2013-11-29 20:02:24 UTC
State Changed
From-To: open->closed

New port added. Thanks!