Matthias Boenig

Matthias Boenig

@tboenig

Project Developer / Digitizer / XML Enthusiast

Berlin
15
Followers
16
Following
104
Public Repos
0
Private Repos

Language Breakdown

Lines of code distribution across 30 owned repositories

389K Total LOC
HTML
200,833 lines
51.6%
N/A
XSLT
166,150 lines
42.7%
N/A
CSS
15,711 lines
4.0%
N/A
Python
5,103 lines
1.3%
N/A
Shell
1,302 lines
0.3%
N/A
T

T-Shaped Developer

T-shaped

Deep in HTML with broad versatility

HTML
XSLT
CSS
Python
Shell

Collaboration Network

Global Impact visualization

LIVE
Matthias Boenig
0 active collaborators

Repos

141

PRs

0

Growth

+18%

Top Collaborators

No collaborator data yet.

Coding Streak

Contribution activity over the past year

1 day
304
Contributions
38
Commits
0
Pull Requests
Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun
Mo
We
Fr
Based on GitHub activity
Less
More

Top Repositories

page2page

This repository save the stylesheet and workaround for transforming the properitary PAGE XML file from Transkribus (https://transkribus.eu/Transkribus) into a PAGE XML valid format (https://www.primaresearch.org/schema/PAGE/gts/pagecontent/ newest version from 2019-07-16

3 2
XSLT
gt_corpus_benchmark

This repo provides a collection of ground truth data. The collection was compiled under different aspects (complexity of the layouts and use of the fonts). The individual data are also characterized by metadata. The metadata is based on the labeling scheme of OCR-D/PrimaLab.

2 0
digi-gt

Ground truth for the digitized historic collections of UB Mannheim

2 0
gt-guidelines

OCR-D guidelines for Ground Truth production

2 1
XSLT
AletheiaTools

AletheiaTools is a collection of tools for transforming file formats (PAGE XML) and metadata formats (METS). It is a kind of Ground Truth Swiss Knife ;-)

2 0
page2tei
2 2
XSLT
mets2exif

METS2EXIF – Automatische METS-Metadaten-Extraktion und Einbettung in die Bild-Datei

1 0
Python
Korpusbildung_Workshop

Die DHd-AG Zeitungen & Zeitschriften bietet am 11. und 12. November 2021 jeweils von 9:00-13:00 Uhr einen virtuellen Workshop an, um anhand digitaler Zeitungs- und Zeitschriftenbestände zu zeigen, wie die für viele Forschende notwendige, individuelle Korpusbildung in Zeitungsportalen selbst sowie mittels NLP-Methoden unterstützt werden kann.

1 0
Jupyter Notebook
gt-fraktur
1 0
choco-mufin

Tools for normalizing the use of some characters and checking file consistencies

1 0
Python

Open Source Impact

Contributions to external projects

13 merged PRs
Contributed to 3 repositories