-
Notifications
You must be signed in to change notification settings - Fork 0
/
README
79 lines (53 loc) · 1.83 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
# -*- rd -*-
ChupaText had been re-implemented by Ruby. This code is no longer
changed. See https://github.com/ranguba/chupa-text/ about new
implementation.
= README --- An introduction of ChupaText, a text extraction utility
== Name
ChupaText
== Author
* Nobuyoshi Nakada <nakada@clear-code.com>
* Kouhei Sutou <kou@clear-code.com>
== License
* Source: LGPLv2.1 or later. (detail:
((<"license/lgpl-2.1.txt"|URL:http://www.gnu.org/licenses/lgpl-2.1.html>)))
* Document: Triple license: LGPL, GFDL and/or CC.
* LGPL: v2.1 or later. (detail:
((<"license/lgpl-2.1.txt"|URL:http://www.gnu.org/licenses/lgpl-2.1.html>)))
* GFDL: v1.3 or later. (detail:
((<"license/gfdl-1.3.txt"|URL:http://www.gnu.org/licenses/fdl.html>)))
* CC: ((<BY-SA|URL:http://creativecommons.org/licenses/by-sa/3.0/>))
* Exceptions:
* modules/excel/: GPLv2. (detail:
((<"license/gpl-2.txt"|URL:http://www.gnu.org/licenses/gpl-2.html>)))
They are included in ((<Gnumeric|URL:http://projects.gnome.org/gnumeric/>)).
* ...
== What's this?
ChupaText is a text extraction utility. It can extracts text
and metadata from PDF and office documents. You can use it
vie library, command line and Web service.
== Dependency libraries and softwares
Required:
* GLib >= 2.24
* libgsf
Optional:
* Poppler
* wv
* libgoffice
* Gnumeric
* LibreOffice, OpenOffice.org or unoconv
* ruby >= 1.9.2
== Get
tar.gz: ((<URL:http://rubyforge.org/frs/?group_id=8073>))
== Repository
There is the repository for ChupaText on
((<GitHub|URL:http://github.com/ranguba/chupatext>)).
% git clone git://github.com/ranguba/chupatext.git
== Install
See ((<install>)).
== Usage
% chupatext [OPTION ...] FILE ...
FILE is a file what you want to extract from.
See ((<chupatext|"doc/chupatext.rd">)) for more details.
== Thanks
* Yuto Hayamizu