Difference between revisions of "Dictionary"

From CCIL
Jump to: navigation, search
(Created page with "=== Goal === The goal of this tutorial is to create a simple dictionary - a database with words from a specific language(s). It will be created in a very simple manner - we s...")
 
(Goal)
Line 1: Line 1:
=== Goal ===
+
== Goal ==
  
 
The goal of this tutorial is to create a simple dictionary - a database with words from a specific language(s). It will be created in a very simple manner - we supply some text to the pipeline (in PDF, TXT or any other popular format), which parses it and insert the words in a unique manner to a database.
 
The goal of this tutorial is to create a simple dictionary - a database with words from a specific language(s). It will be created in a very simple manner - we supply some text to the pipeline (in PDF, TXT or any other popular format), which parses it and insert the words in a unique manner to a database.
 +
 +
 +
What we have to do?
 +
# Parse text which comes in an arbitrary format
 +
# Insert all tokens words from it, which satisfy the 'word' criteria in a database with no duplications

Revision as of 03:47, 17 May 2017

Goal

The goal of this tutorial is to create a simple dictionary - a database with words from a specific language(s). It will be created in a very simple manner - we supply some text to the pipeline (in PDF, TXT or any other popular format), which parses it and insert the words in a unique manner to a database.


What we have to do?

  1. Parse text which comes in an arbitrary format
  2. Insert all tokens words from it, which satisfy the 'word' criteria in a database with no duplications