The CJK Dictionary Institute
CJKI

ENG   


Dictionaries
   Resources
   Overview
   Japanese
   Chinese
   Korean
   Arabic
   How to Order
   Dictionaries


Websites
   Articles/papers
   What is CJKI?
   What is KDPS?
   Jack Halpern
   Links

 

Principal Japanese Lexical Resources

Our comprehensive Japanese lexical resources contain over three million entries covering general vocabulary, technical terms, proper nouns, company names, etc., used in such applications as machine translation (MT), information retrieval (IR) and input method editors (IME).

  1. News Flash: Japanese-English Geographical Database provides comprehensive coverage of cities and towns, streets and neighborhoods, roads, train stations, schools, points of interest, and government buildings.

  2. World's first: Japanese Phonological Database for speech technology with phonetic attributes such as accents and precise IPA allophonic variants. Names dictionary with accent data now available. Also see in-depth paper.

  3. News Flash: Japanese Personal Name Variants Database expanded to about four million name variants and hybrids covering seven romanization systems.

  4. News Flash: Database of Japanese Place Name Variants provides hundreds of thousands of variants for cities and towns, streets and neighborhoods, as well as train stations, schools, and government buildings.

  5. Major Expansion: Our Comprehensive Database of Technical Terminology, which exceeded one million terms several years ago, is now undergoing major expansion and improvements using state-of-the art computational lingistics techniques. This bidirectional Japanese-English dictionary database covers the major domains of science and technology. Also provided as domain-specific stand-alone modules, including:


  6. News Flash: Multilingual Database of Proper Nouns, covering Japanese, Simplified Chinese, Traditional Chinese, English and Korean, popular in electronic dictionaries.

  7. News Flash: Chinese-Japanese Database of Technical Terminology , covering 15 domains with special focus on computer/IT terminology, undergoes major expansion.

  8. Major Expansion: The CJKI Japanese-English Dictionary, an up-to-date dictionary covering 110,000 entries of general vocabulary and important proper nouns, has undergone major maintenance. The sister edition, The CJKI English-Japanese Dictionary, covers 82,000 entries of general vocabulary and important proper nouns.

  9. Just revised (2008): The CJKI English-Japanese Dictionary. An English-Japanese dictionary 82,000 entries covering general vocabulary and important proper names. This can be expanded to cover Western names and technical terms.

  10. Japanese Lexical Database. An database of about 300,000 entries covering general vocabulary with a rich set of grammatical attributes. Designed specifically for such applications as tokenization and information retrieval, this database is of critical importance for Japanese information processing.

  11. Comprehensive Database of Japanese Proper Nouns. A 1.4 million-entry Japanese-English dictionary of personal names (surnames and given names), place names and company names, semantically classified and ranked by frequency. Ideal for NLP applications such as IR, MT and NER.

  12. Japanese-English Dictionary of Western Names. A Japanese-English-Japanese dictionary of about 60,000 non-Japanese personal and place names with semantic classification codes and orthographic variants for personal many names.

  13. World's largest database of Japanese orthographic variants. Used by such major portals as Yahoo and Amazon, it is ideal for intelligent IR applications and advanced linguistic tools. Also see in-depth paper.

  14. Katakana Lexical Database. This comprehensive lexical database of katakana words, with various grammatical attributes, plays a vital role in enhancing NLP applications.

  15. IT and Computer Terminology. A comprehensive, constantly growing bilingual Japanese-English database of about 125,000 up-to-date computer terms. In addition, a Japanese-Chinese-English Multilingual Database of Computer Terms is also available.

  16. Named Entity Recognition A database of Japanese Named Entity Contextual Clues, keywords that precede or follow named entities, which play a critical role in enhancing the precision of entity recognition and extraction.

  17. Japanese Companies and organizations About 600,000 Japanese company and organization names ranked by frequency with English equivalents when appropriate.

  18. Japanese-English Neologisms. Our large Japanese-English database of about 30,000 Japanese neologisms.

  19. English-Japanese Neologisms. English-Japanese database, an up-to-date and accurate lexical database maintained by our team of Japanese editors.

  20. Show Business Celebrities. Japanese-English database of Japanese and international show business celebrities, including singers, actors, and entertainers ideal for mobile platform IMEs.

  21. Japanese Frequency Statistics. A comprehensive database of Japanese lexical statistics, such as frequency of occurrence of words and characters, based on large corpora.

  22. Morphological Attributes in Japanese. This document describes some of the morphological attributes in our Japanese lexical databases, such as derivational attributes and binding valency, which are particularly useful for disambiguating and identifying Japanese lexemes in such applications as input method editors (IME) and search engine query processing.

  23. Business and Finance Terms. A comprehensive Japanese-English database of up-to-date terms for business, finance and economics.

  24. Kanji-English Lexicon. A Kanji-English database that includes the comprehensive features of New Japanese-English Character Dictionary, which has become a standard reference work in Japanese education circles.

  25. Kanji Database. A single -character database that covers every aspect of CJK characters, including frequency, phonology, radicals, character codes, and other attributes.

 

 

CJKI Home