Results 11 to 20 of about 4,716 (312)
The Balinese Unicode Text Processing
In principal, the computer only recognizes numbers as the representation of a character. Therefore, there are many encoding systems to allocate these numbers although not all characters are covered.
Imam Habibi, Rinaldi Munir
doaj +3 more sources
Unicode at Gigabytes per Second [PDF]
We often represent text using Unicode formats (UTF-8 and UTF-16). The UTF-8 format is increasingly popular, especially on the web (XML, HTML, JSON, Rust, Go, Swift, Ruby). The UTF-16 format is most common in Java, .NET, and inside operating systems such as Windows.
Daniel Lemire, Lemire, Daniel
openaire +5 more sources
briandfoy/unicode-unihan: Unihan-0.043
Revision history for Perl extension Unicode::Unihan. 0.043 2022-03-08T01:18:31Z * Makefile.PL is more careful about the DBM files it creates. This responds to GitHub #2, and RT-125349 and RT-75802.
brian d foy
core +1 more source
EXPONENTIAL IMPROVEMENT IN PRECISION FOR SIMULATING SPARSE HAMILTONIANS
We provide a quantum algorithm for simulating the dynamics of sparse Hamiltonians with complexity sublogarithmic in the inverse error, an exponential improvement over previous methods. Specifically, we show that a
DOMINIC W. BERRY +4 more
doaj +1 more source
Let $G$ be a finite group acting transitively on a set $\unicode[STIX]{x1D6FA}$
NICK GILL
doaj +1 more source
COVER TIME FOR THE FROG MODEL ON TREES
The frog model is a branching random walk on a graph in which particles branch only at unvisited sites. Consider an initial particle density of $\unicode[STIX]{x1D707}$ on the full $d$-ary tree of height $n$.
CHRISTOPHER HOFFMAN +2 more
doaj +1 more source
HIGHER RANDOMNESS AND GENERICITY
We use concepts of continuous higher randomness, developed in Bienvenu et al. [‘Continuous higher randomness’, J. Math. Log. 17(1) (2017).], to investigate $\unicode[STIX]{x1D6F1}_{1}^{1}$
NOAM GREENBERG, BENOIT MONIN
doaj +1 more source
A Variant Character Dataset for Historical Narratives of Middle and Late Imperial China
Due to a tradition of valuing historical records, pre-modern China developed a wide range of historical narratives, not limited to the Official History. Computational analysis of Chinese historical narratives is hampered by the historical, regional, and ...
Jiwon Lee, Youngim Jung
doaj +1 more source
GEOMETRIC BIJECTIONS FOR REGULAR MATROIDS, ZONOTOPES, AND EHRHART THEORY
Let $M$ be a regular matroid. The Jacobian group $\text{Jac}(M)$ of $M$ is a finite abelian group whose cardinality is equal to the number of bases of $M$.
SPENCER BACKMAN +2 more
doaj +1 more source
A unicode based adaptive segmentor [PDF]
This paper presents a Unicode based Chinese word segmentor. It can handle Chinese text in Simplified, Traditional, or mixed mode. The system uses the strategy of divide-and-conquer to handle the recognition of personal names, numbers, time and numerical values, etc in the preprocessing stage.
Q. Lu +5 more
openaire +1 more source

