Difference between revisions of "Parse"

From CCIL
Jump to: navigation, search
(API)
(API)
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
== About ==
 
== About ==
  
This component covers the ability of CCIL to read and basically understand various formats of inbound media. The API is extendable, so far the supported formats are:
+
This component covers the ability of CCIL to read and understand at a a basic level various formats of inbound media.
 +
The API is extendable at a maximal degree. So far the supported formats are:
 
* Plain Text
 
* Plain Text
 
* PDF
 
* PDF
Line 10: Line 11:
 
The API implements the following ones:
 
The API implements the following ones:
 
{|
 
{|
  |+ Name
+
  ! Name
  |+ Constant
+
  ! Constant
  |+ Description
+
  ! Description
 
  |-
 
  |-
 
  | Title
 
  | Title
Line 22: Line 23:
 
  | The textual body of the media.
 
  | The textual body of the media.
 
  |}
 
  |}
 +
 +
== Services ==
 +
* TikaParserService

Latest revision as of 09:18, 14 May 2017

About

This component covers the ability of CCIL to read and understand at a a basic level various formats of inbound media. The API is extendable at a maximal degree. So far the supported formats are:

  • Plain Text
  • PDF

API

The central point is the Parser interface. Its main purpose is to decompose an InputStream to the features it contains.

The API implements the following ones:

Name Constant Description
Title Parser.TITLE The title of the media.
Content Parser.CONTENT The textual body of the media.

Services

  • TikaParserService