Subject: 

         Japanese on-line character databases in Unipen format

    Date: 

         Mon, 4 Jun 2001 21:04:26 +0900

   From: 

         Stefan Jaeger 

     To: 

         SCRIB-L@NIC.SURFNET.NL



Dear Colleagues,



we have collected two databases containing Japanese

on-line characters, Kuchibue and Nakayosi, and

transformed them into the plain ASCII Unipen format.



Kuchibue contains about 1.4 million characters

donated by 120 writers. Nakayosi comprises almost

1.7 million characters from 163 writers. The sets

of writers for Kuchibue and Nakayosi do not overlap.

Each writer of Kuchibue donated 11962 characters

covering 3356 Kanji categories. In Nakayosi, each

writer donated 10403 characters covering 4438

categories, which include more than 1000 special

Kanji characters for naming. Altogether, this sums

up to more than 3 million characters donated by

283 writers.



Kuchibue was collected by capturing mouse events

under a Microsoft Windows environment while writing

with pen on a LCD tablet. Nakayosi was collected by

capturing genuine tablet coordinates. For the Unipen

format, the originally SJIS coded labels of Kuchibue

and Nakayosi are listed with their hexadecimal

representations.



(...)



Best regards,



Stefan Jaeger



Here is the site:

TUAT Nakagawa Lab. HANDS-kuchibue_d-97-06-10



----------------------------------------------------------

Stefan Jaeger

Dept. of Computer, Information and Communication Sciences

Tokyo Univ. of Agri. & Tech., Nakagawa Laboratory

Naka-cho 2-24-16, Koganei, Tokyo, 184-8588, Japan

phone: +81-423-88-7273, fax: +81-423-87-4604

e-mail: stefan@hands.ei.tuat.ac.jp

http://www.tuat.ac.jp/~nakagawa/