Difference between revisions of "Parse"
From CCIL
(Created page with "This component covers the ability of CCIL to read and basically understand various formats of inbound media.") |
(→API) |
||
(4 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | This component covers the ability of CCIL to read and | + | == About == |
+ | |||
+ | This component covers the ability of CCIL to read and understand at a a basic level various formats of inbound media. | ||
+ | The API is extendable at a maximal degree. So far the supported formats are: | ||
+ | * Plain Text | ||
+ | * PDF | ||
+ | |||
+ | == API == | ||
+ | The central point is the Parser interface. Its main purpose is to decompose an [https://docs.oracle.com/javase/7/docs/api/java/io/InputStream.html InputStream] to the features it contains. | ||
+ | |||
+ | The API implements the following ones: | ||
+ | {| | ||
+ | ! Name | ||
+ | ! Constant | ||
+ | ! Description | ||
+ | |- | ||
+ | | Title | ||
+ | | Parser.TITLE | ||
+ | | The title of the media. | ||
+ | |- | ||
+ | | Content | ||
+ | | Parser.CONTENT | ||
+ | | The textual body of the media. | ||
+ | |} | ||
+ | |||
+ | == Services == | ||
+ | * TikaParserService |
Latest revision as of 09:18, 14 May 2017
About
This component covers the ability of CCIL to read and understand at a a basic level various formats of inbound media. The API is extendable at a maximal degree. So far the supported formats are:
- Plain Text
API
The central point is the Parser interface. Its main purpose is to decompose an InputStream to the features it contains.
The API implements the following ones:
Name | Constant | Description |
---|---|---|
Title | Parser.TITLE | The title of the media. |
Content | Parser.CONTENT | The textual body of the media. |
Services
- TikaParserService