Download List

프로젝트 설명

The Okapi project’s main purpose is to architect a set of building blocks for the creation of larger open source localization and translation tools. But many Okapi components are generic enough to be of interest to the text mining, natural language processing, and text retrieval communities. Okapi’s many text filters (HTML, Properties, XML (ITS XPath-based rules), OpenXML, ODF, Regex etc.) provide a straightforward way to access the text of multiple document formats. Its document events and pipeline can be made to integrate with other frameworks such as UIMA, LingPipe, OpenPipeline, OpenNLP, GATE, and Lucene. The advantage of Okapi’s text filters is that not only is text extracted, but all non-textual formatting is preserved. It is possible to decompose a document into events, process them via the pipeline, and then rebuild the input document without loss. Structural information can be added to Okapi document events so that tables, lists, links, titles etc. are grouped together and treated as a unit. This is useful when context based on a “universal” document structure is needed. The Okapi event model supports user configurable annotations, similar to UIMA, but simpler and more restricted in scope. User can annotate spans of text or add new resources such as translation memory matches, terminology, token types, or part of speech information.

System Requirements

System requirement is not defined
Information regarding Project Releases and Project Resources. Note that the information here is a quote from Freecode.com page, and the downloads themselves may not be hosted on OSDN.

2009-08-31 23:14
0.3

Okapi Framework의 두 가지 패키지 : okapi -로 배포됩니다 lib 디렉토리, 모든 라이브러리는 아니지만 UI는 부분, 그리고 okapi - 애플 리케이션, 이는 UI가 부품과 무지개와 오소리의 일종과 같은 몇 가지 애플 리케이션과 같은 파일을 포함 포함 개발자를위한. Tikal 단순 현지화 수행하는 새로운 도구를 추출과 같은 작업과 관련된 다른 파일 형식의 XLIFF에 병합합니다. 새로운 Qt는 TS 필터가 추가되었습니다. 번역 Toolkit은 TM을위한 새로운 커넥터가 추가되었습니다.
The Okapi Framework is now distributed as two packages: okapi-lib, for developers, which includes all the libraries but not UI parts, and okapi-apps, which includes the same files with the UI parts and several applications such as Rainbow and Ratel. Tikal is a new tool to perform simple localization-related tasks like extraction and merging of different file formats to XLIFF. A new Qt TS filter has been added. A new connector for Translate Toolkit TM has been added.

2009-05-02 02:26
Milestone 1

첫 번째 이정표를, 및 다른 포맷뿐만 아니라, 레인보우, 관련 작업, 그리고 오소리의 일종, 세분화 규칙 편집기 다양한 현지화 수행하기위한 도구 상자에서 XML을 HTML과, 포, 속성, XLIFF에 대한 필터 설정이 포함되어있습니다.
The first milestone includes a set of filters for XML, HTML, PO, Properties, XLIFF, and other formats as well as Rainbow, a toolbox for performing various localization-related tasks, and Ratel, an editor for segmentation rules.

Project Resources