
Y. XU ET AL. 185
Algorithm for real: Through the gradual increasing or
reducing the length of a code string such as X, search the
code string in the standard code string set which has the
minimum distance with the unknown code string as the
results, and transform the stroke matching problem into
solving the best matching problem between two symbol
strings.
There are a lot of attached strokes through the free
writing, and strokes string to be identified is not known
in advance, then we need to take stroke splitting, and
each splitting produces a series of strokes to be knowl-
edge, so this method has multi-step iterative and volume
computation. We recognize that: 1) Long stroke is more
steady than short stroke; 2) Different types of strokes
have different possibility of confusion. Focusing on these
two differences, this paper has proposed to take “fuzzy
numbers of length” and “fuzzy numbers of direction”
treatment strategy. See the following section.
4. Combining Trial
Because there are a large number of connected strokes
that is multiple strokes connected with unconstrained
handwritten Chinese characters (belong to the same
natural strokes), and we can not ask the writer to separate
a painting to each stroke, stroke recognition includes two
processes which are stroke splitting and stroke judgment,
requiring to make a split decision from the entire word
which comes from an unknown mode and then make a
decision. These two processes complement each other,
alternately. So, we propose the combining trial to recog-
nize stroke.
The recognition process of combining trial is a tenta-
tive process of splitting and judgment constantly:
Split→Judge→Split again→Judge again …
Input: the direction strokes code sequence of a natural
stroke (string);
Output: the stroke code sequence of the natural stroke
(string);
Rule number one: Principle of giving priority to take
large;
Rule number two: Folded strokes determine the seg-
ments.
4.1. Strategy of Strokes Splitting-Increasing Test
For strokes codes sequence of a natural stroke (which
may contain multiple strokes), from the first strokes
codes, take a strokes code each time increasing to form a
test stroke code, take the dictionary matching and save
the matching results temporarily; then take the next
strokes code to form a new test stroke code for matching
operations, and so on. If you have extracted a predeter-
mined one of a few more strokes off, that is, the extrac-
tion of strokes contained in all segments is completed,
the stroke has been identified. If the strokes code of
natural stroke code does not end, then clear the data
structures which placed test stroke code (delete the test
stroke code),then take the next strokes code to re-formed
a test stroke code, and to determine the next stroke (the
natural stroke with multiple strokes), Until all of the
strokes code of natural stroke have been taken into the
match, then take the stroke which contains the largest
number of strokes as the match result and return it
(whichever is greater priority), to give priority to extract
folded stroke.
4.2. Strategy of Stroke Judgment-Similar
Matching
Precise matching technology requires that signature code
string to be identified must be equal to the stored signa-
ture code string in the feature dictionary. So, the refer-
ence template of the pattern in the dictionary must have
equivalent coverage, i.e., comprehensive, can cover the
most common deformation of the pattern. Obviously, the
high matching accuracy and the fast determination speed
are obvious advantages of the technology, and recogni-
tion performance mainly depends on the completeness of
the reference template. Because mode deformation range
can not be limited, it may have rejection (the code is not
in the library). In view of this, the establishment of a
more complete feature library is an important task [3,4].
Similarity matching, it is raised by the unpredictable
issues of deformation, it is usually take “distance” or
“similar degree” of model to be knowledge and reference
model as model criterion, and the definition of these cri-
teria varied. The system uses the strategy of accurate
identification first and then similar identification, that is,
when exact match Produces rejection, it is transferred to
produce similar matching module, to add similar identi-
fication of strokes of rejection. The string similarity,
matching techniques are varied, more typical of them are
dynamic programming method, fuzzy property law, the
error correction method and various weighted matching
method. When the stroke code uses non-equal length
strokes code string, in order to reflect these differences of
strokes in length and in type, this paper proposes fuzzy
numbers of length and fuzzy numbers of direction to deal
with them.
Take several written of folded stroke “乙”as an exam-
ple, Figure 2(a) as the standard wording, standard stroke
code is Gaca
and others are stroke variant. The
strokes code set
,,,abcd is the 4 yards direction code
of the strokes. These graphics are similar but clearly
different. The difference is that the String to be identified
has produced a distortion, and we called those strokes
code that out of standard strokes or produced a distortion
as deformed strokes. If you take exact matching and
Copyright © 2012 SciRes. OJAppS