experiments,

Things I learned experimenting during Thanksgiving week

Gerald Haslhofer Gerald Haslhofer Follow Nov 27, 2020 · 1 min read
Things I learned experimenting during Thanksgiving week
Share this
  • Taking a screenshot from Windows in the right resolution and scale factor
  • Using Tesseract OCR engine
  • spaCy NLP python library. Demo with spacy
  • Powershell scripts and how to launch them (to access Windows.Media.OCR capabilities)
  • Using Google cloud for text classification
  • Creating a Flask webserver hosting python code to expose capabilities to a local c# app (instead of always launching the python exe given that’s too slow)
  • Authenticating using MSAL to access resources in MS Graph
  • Adding pages to OneNote (including inline base64 encoded images embedded in HTML to avoid having to use multi-part messages which never worked)
  • Creating c# structure based on Json payload. source

Future:

Extract summaries:

  • https://pypi.org/project/bert-extractive-summarizer/

Layout analysis:

  • https://github.com/Layout-Parser/layout-parser/blob/master/examples/Deep%20Layout%20Parsing.ipynb
    • based on Facebook’s Detectron framework
    • needs MS c++ 14+
  • https://dhsegment.readthedocs.io/en/latest/start/annotating.html
  • https://huggingface.co/transformers/model_doc/layoutlm.html based on https://arxiv.org/pdf/1912.13318v5.pdf

Other sources and relative importance:

  • annotated screens: https://github.com/ibm-aur-nlp/PubLayNet
  • projects about document layout analysis : https://github.com/topics/document-layout-analysis