Difference between revisions of "Parse"

From CCIL
Jump to: navigation, search
(Created page with "This component covers the ability of CCIL to read and basically understand various formats of inbound media.")
 
(API)
 
(4 intermediate revisions by the same user not shown)
Line 1: Line 1:
This component covers the ability of CCIL to read and basically understand various formats of inbound media.
+
== About ==
 +
 
 +
This component covers the ability of CCIL to read and understand at a a basic level various formats of inbound media.
 +
The API is extendable at a maximal degree. So far the supported formats are:
 +
* Plain Text
 +
* PDF
 +
 
 +
== API ==
 +
The central point is the Parser interface. Its main purpose is to decompose an [https://docs.oracle.com/javase/7/docs/api/java/io/InputStream.html InputStream] to the features it contains.
 +
 
 +
The API implements the following ones:
 +
{|
 +
! Name
 +
! Constant
 +
! Description
 +
|-
 +
| Title
 +
| Parser.TITLE
 +
| The title of the media.
 +
|-
 +
| Content
 +
| Parser.CONTENT
 +
| The textual body of the media.
 +
|}
 +
 
 +
== Services ==
 +
* TikaParserService

Latest revision as of 09:18, 14 May 2017

About

This component covers the ability of CCIL to read and understand at a a basic level various formats of inbound media. The API is extendable at a maximal degree. So far the supported formats are:

  • Plain Text
  • PDF

API

The central point is the Parser interface. Its main purpose is to decompose an InputStream to the features it contains.

The API implements the following ones:

Name Constant Description
Title Parser.TITLE The title of the media.
Content Parser.CONTENT The textual body of the media.

Services

  • TikaParserService