I. bibliografie


Lace: Greek OCR  Overview



Download 7.27 Mb.
Page23/49
Date09.06.2018
Size7.27 Mb.
#53829
1   ...   19   20   21   22   23   24   25   26   ...   49

Lace: Greek OCR 

Overview


This site catalogues the results of our 2012/13 campaign to produce high-quality OCR of polytonic, or 'ancient', Greek texts in a HPC environment. It comprises over 600 volumes from archive.org and from original scans. There are over 6 million pages of OCR output in total, including experimental and rejected results.

Results are presented in a hierarchical organization, beginning with the archive.org volume identifier. Each of these are associated with one or more 'runs', or attempts at OCRing this volume. A run has a date stamp and is associated with a classifier and an aggregate best b-score (roughly indicating quality of Greek output.) Each run produces various kinds of output:



  1. raw hocr output: the data generated by our OCR process, usually with multiple copies for each page, rendered at a range of binarization thresholds

  2. selected hocr output: a filtered version of the data in (1), with each page image represented by a single, best, output page

  3. blended hocr output: the data in (2), but replaced with the corresponding words from the raw output in (1), should the selected page not comprise a dictionary word and one of the raw pages comprises one.

  4. selected hocr output spellchecked: the data in (3) processed through a weighted levenshtein distance spellchecking algorithm that is meant to correct simple OCR errors

  5. combined hocr output: where archive.org provides OCR output for Latin script (not Greek), this final step pieces together the data in (4) with archive's output, preferring archive's output where our output suggests that the data is Latin. If archive.org provides Greek output, this step is no different from (4)

Code


All code and classifiers for Rigaudon are posted in a github repository. This holds the modified Gamera source code, ancillary python scripts such as the spellcheck engine, and the bash scripts that coordinate the process in a HPC environment through Sun Grid Engine.

Details of its operation are outlined in a white paper.



Our July 2013 presentation at the London Digital Classicist seminar series is available online from the Institue of Classical Studies.


L'Antiquité Classique
L'Antiquité Classique est une revue annuelle, de renommée internationale, spécialisée dans le domaine de l'Antiquité grecque et romaine (de la période préhellénique jusqu'à l'Antiquité tardive ou aux aspects de la Renaissance liés aux études antiques). Soutenue par la Fondation universitaire de Belgique et le Fonds de la Recherche scientifique (FNRS), la revue publie dans les langues usuelles de la recherche (anglais, français, allemand, italien, espagnol…) des contributions originales, soumises préalablement à l'avis d'un Comité de lecture (avec experts internationaux). 

Available periods  :

1932-1939


  • 1932 : [ 1-1-2 ]

  • 1933 : [ 2-1 ]

  • 1934 : [ 3-1 ]

  • 1937 : [ 6-1 ]

  • 1938 : [ 7-1 ] [ 7-2 ]

  • 1939 : [ 8-1 ]

1940-1949


  • 1940 : [ 9-1 ]

  • 1941 : [ 10-1 ]

  • 1942 : [ 11-2 ]

  • 1943 : [ 12-1 ]

  • 1944 : [ 13-1 ]

  • 1945 : [ 14-1 ] [ 14-2 ]

  • 1946 : [ 15-1 ] [ 15-2 ]

  • 1947 : [ 16-1 ] [ 16-2 ]

  • 1949 : [ 18-1 ] [ 18-2 ]

1950-1959


  • 1950 : [ 19-1 ]

  • 1951 : [ 20-1 ] [ 20-2 ]

  • 1952 : [ 21-1 ] [ 21-2 ]

  • 1953 : [ 22-1 ] [ 22-2 ]

  • 1954 : [ 23-1 ] [ 23-2 ]

  • 1955 : [ 24-1 ] [ 24-2 ]

  • 1956 : [ 25-1 ] [ 25-2 ]

  • 1957 : [ 26-2 ]

  • 1958 : [ 27-1 ]

1960-1969


  • 1961 : [ 30-2 ]

  • 1962 : [ 31-1-2 ]

  • 1963 : [ 32-1 ] [ 32-2 ]

  • 1964 : [ 33-1 ]

  • 1965 : [ 34-2 ]

  • 1966 : [ 35-1 ] [ 35-2 ] [ Suppl ]

  • 1967 : [ 36-1 ]

  • 1968 : [ 37-2 ]

  • 1969 : [ 38-1 ]

1970-1979


  • 1970 : [ 39-1 ] [ 39-2 ]

  • 1971 : [ 40-1 ] [ 40-2 ]

  • 1972 : [ 41-1 ] [ 41-2 ]

  • 1973 : [ 42-1 ]

  • 1975 : [ 44-2 ]

  • 1976 : [ 45-1 ] [ 45-2 ]

  • 1977 : [ 46-1 ] [ 46-2 ]

  • 1978 : [ 47-1 ] [ 47-2 ]

  • 1979 : [ 48-1 ] [ 48-2 ]

1980-1989


  • 1980 : [ 49 ]

  • 1982 : [ 51 ]

  • 1983 : [ 52 ]

  • 1984 : [ 53 ]

  • 1986 : [ 55 ]

  • 1988 : [ 57 ]

1990-1999


  • 1994 : [ 63 ]

  • 1995 : [ 64 ]

  • 1996 : [ 65 ]

2000-2007


  • 2000 : [ 69 ]

  • 2002 : [ 71 ]

  • 2003 : [ 72 ]

  • 2004 : [ 73 ]

  • 2005 : [ 74 ]





Acta Classica: Proceedings of the Classical Association of South Africa
ISSN 0065-1141



Acta Classica (ISSN 0065-1141) publishes articles (536), notes (162), and reviews (107). The language of publication is mainly English (650), but many contributions have also been written in Afrikaans (72), German (62), French (11), Dutch (9), Latin (5), and Italian (2). 

 
Acta Classica is an international journal. It has published work by scholars residing in South Africa (550), the United States of America (69), the United Kingdom of Great Britain (38), Canada (38), Australia (35), Germany (26), The Netherlands (13), Rhodesia and Nyasaland / Zimbabwe / Tanzania (11), Belgium (5), New Zealand (4), Italy (4), Israel (3), Poland (2), Greece (2), France (2), and Japan (1).

The journal publishes work in all fields of Classics, from textual criticism (37) to the Classical Tradition / Reception Studies (17). Many contributions have been made in the field of Ancient History (approximately 188), but the majority have been literary in nature (305). Further contributions have been made in the field of Ancient Philosophy (42) and Ancient Religion (14). Some interesting work has also been done in the history of Classical Scholarship -- including the work of South African Classics scholars (52) -- Lexicography (19), Epigraphy (12), Art (10),  and Archaeology (2). There have also been articles in such diverse areas of study as Research Methodology in Classics (3) and Byzantine / Medieval Studies (18).

The longest article published in the journal, written in German, runs to over fifty pages, the shortest to just five, but on average articles are in the region of thirteen to fifteen pages in length. 


Users of Endnote may want to download the Acta Classica Endnote style (ActaClassica.ens) and the compressed data files for work published in the journal (ActaClassica.enlx) in order to search for articles, notes, and reviews, using this bibliographical package.

All articles from Volume 49 (2006) to Volume 1 (1958) are available in open access from this site.






Vol 57 (2014)

Vol 56 (2013)

Vol 55 (2012)

Vol 54 (2011)

Vol 53 (2010)

Vol 52 (2009)

Vol 51 (2008)

Vol 50 (2007)

Vol 49 (2006)

Vol 48 (2005)

Vol 47 (2004)

Vol 46 (2003)

Vol 45 (2002)

Vol 44 (2001)

Vol 43 (2000)

Vol 42 (1999)

Vol 41 (1998)

Vol 40 (1997)

Vol 39 (1996)

Vol 38 (1995)

Vol 37 (1994)

Vol 36 (1993)

Vol 35 (1992)

Vol 34 (1991)

Vol 33 (1990)

Vol 32 (1989)

Vol 31 (1988)

Vol 30 (1987)

Vol 29 (1986)

Vol 28 (1985)

Vol 27 (1984)

Vol 26 (1983)

Vol 25 (1982)

Vol 24 (1981)

Vol 23 (1980)

Vol 22 (1979)

Vol 21 (1978)

Vol 20 (1977)

Vol 19 (1976)

Vol 18 (1975)

Vol 17 (1974)

Vol 16 (1973)

Vol 15 (1972)

Vol 14 (1971)

Vol 13 (1970)

Vol 12 (1969)

Vol 11 (1968)

Vol 10 (1967)

Vol 9 (1966)

Vol 8 (1965)

Vol 7 (1964)

Vol 6 (1963)

Vol 5 (1962)

Vol 4 (1961)

Vol 3 (1960)

Vol 2 (1959)

Vol 1 (1958)









  • BIBLE, JUDAISM, CHRISTIANITY via Google Book Search

  • AGSL Digital Photo Archive - Asia and Middle East

  • Open Access Urkunden des aegyptischen Altertums





Download 7.27 Mb.

Share with your friends:
1   ...   19   20   21   22   23   24   25   26   ...   49




The database is protected by copyright ©ininet.org 2024
send message

    Main page