Subject:
Japanese on-line character databases in Unipen format
Date:
Mon, 4 Jun 2001 21:04:26 +0900
From:
Stefan Jaeger
To:
SCRIB-L@NIC.SURFNET.NL
Dear Colleagues,
we have collected two databases containing Japanese
on-line characters, Kuchibue and Nakayosi, and
transformed them into the plain ASCII Unipen format.
Kuchibue contains about 1.4 million characters
donated by 120 writers. Nakayosi comprises almost
1.7 million characters from 163 writers. The sets
of writers for Kuchibue and Nakayosi do not overlap.
Each writer of Kuchibue donated 11962 characters
covering 3356 Kanji categories. In Nakayosi, each
writer donated 10403 characters covering 4438
categories, which include more than 1000 special
Kanji characters for naming. Altogether, this sums
up to more than 3 million characters donated by
283 writers.
Kuchibue was collected by capturing mouse events
under a Microsoft Windows environment while writing
with pen on a LCD tablet. Nakayosi was collected by
capturing genuine tablet coordinates. For the Unipen
format, the originally SJIS coded labels of Kuchibue
and Nakayosi are listed with their hexadecimal
representations.
(...)
Best regards,
Stefan Jaeger
Here is the site:
TUAT Nakagawa Lab. HANDS-kuchibue_d-97-06-10
----------------------------------------------------------
Stefan Jaeger
Dept. of Computer, Information and Communication Sciences
Tokyo Univ. of Agri. & Tech., Nakagawa Laboratory
Naka-cho 2-24-16, Koganei, Tokyo, 184-8588, Japan
phone: +81-423-88-7273, fax: +81-423-87-4604
e-mail: stefan@hands.ei.tuat.ac.jp
http://www.tuat.ac.jp/~nakagawa/