Results 241 to 250 of about 323,110 (290)

The Web for Corpus and the Web as Corpus in Translator Training

2013
New Voices in Translation Studies, 10, 1, 54 ...
Buendía-Castro, Miriam   +1 more
openaire   +1 more source

The World Wide Web as Linguistic Corpus

2003
Increasingly, corpus linguists have begun using the World Wide Web as a corpus for conducting linguistic analyses. The Web, however, is really a very different kind of corpus: we do not know, for instance, precisely how large it is or what kinds of texts are on it.
Charles F. Meyer   +4 more
openaire   +1 more source

Using the Web as corpus for self-training text categorization

Information Retrieval, 2008
Most current methods for automatic text categorization are based on supervised learning techniques and, therefore, they face the problem of requiring a great number of training instances to construct an accurate classifier. In order to tackle this problem, this paper proposes a new semi-supervised method for text categorization, which considers the ...
Rafael Guzmán-Cabrera   +3 more
openaire   +1 more source

The academic Web-as-Corpus

2013
As a result of the European Union’s pressure towards internationalization, universities in many countries find themselves increasingly urged to provide information on their requirements and services and to promote themselves in English on the web. Hence the need for corpus resources and studies of institutional academic English used as an international
FERRARESI, ADRIANO, BERNARDINI, SILVIA
openaire   +1 more source

The Web as Corpus and Authorship Attribution

2014
In order to understand the potential limitations and problems with using the web as a corpus during forensic investigations, key issues covered in this chapter include a description of the web and how it is searched using commercial search engines. The reliability of search engines is then discussed with evidence suggesting that whilst search engine ...
openaire   +1 more source

The Web as Corpus: Theory and Practice. Maristella Gatto.

Digital Scholarship in the Humanities, 2015
The Web as Corpus: Theory and Practice. Maristella Gatto. London/New York: Bloomsbury, 2014. xxii + 232 pp. ISBN 978-14-411-6112-3. $42.95 (paperback). The Web as Corpus: Theory and Practice is a timely and thorough introduction to the promising field of ‘Web as Corpus’ (hereafter WaC) at a time when exponentially cumulating online language use has ...
openaire   +1 more source

GlossaNet

Lingvisticae Investigationes, 1999
GlossaNet is an automated system that monitors Web sites. On dates and at intervals selected by the user, GlossaNet downloads the Web site, converts it to an electronic corpus and uses the intex programs (M. Silberztein 1993) and the linguistic resources of the ladl (electronic dictionaries and libraries of local grammars) to parse it.
openaire   +1 more source

Home - About - Disclaimer - Privacy