Mastering the Tech Interview: A Comprehensive Guide to Success Throughout my career, I have gained extensive experience in technical interviews, having been on both sides of the process – as an interviewer and an interviewee. I have had the privilege of interviewing
ACL 2020 Highlights: Interpretability, Evaluation and more. This post discusses highlights of the main conference of the 2020 Annual Meeting of the Association for Computational Linguistics (ACL 2020). The conference accepted 779 papers with an acceptance rate of 22.7%, had
ACL 2019 Highlights This post discusses highlights of the main conference of the 2019 Annual Meeting of the Association for Computational Linguistics (ACL 2019). Note that these notes are written with business applications in mind.
mcQA - Multiple Choice Question Answering mcQA is a multiple choice question answering python library, using Language Models.
EMNLP 2018 Highlights In this post, I share my notes from the conference on Empirical Methods for Natural Language Processing, which took place in Brussels, Belgium, from October 31th to November 4th 2018. The tutorials, workshops
Building Parallel Corpora Using Cross-Lingual BOW Training machine translation models requires a huge amount of parallel data. Consequently, there has been many works suggesting different methods to build bilingual corpora, leading to the construction of reliable training datasets for
Clause Augmentation for Better NMT Most public parallel corpora are formed of long sentences. Consequently, neural translation models tend to generate a long output with n-grams repetition, even when they are exposed to a short sequence or a