Blog

How To Execute Python Modules From Java.

This tutorial is based on macOSX with an M1 chip for Python 3.8/3.9 and Java 11. Intro. From my practice, one of the last cases I met was the task when the base application engine was in Java, and I was supposed to add a new Machine Learning feature. The key challenge here was that the…

Deploy an NLP pipeline. Flask+Heroku+Bert.

A simple, quick solution to deploy an NLP project and challenges you may face during the process. Intro. Very often, as a data scientist, you may be faced with a task that includes a complete pipeline: from the data collection up to deploy the app on the server. I bumped into such an odd job…

Hugging Face DistilBert & Tensorflow for Custom Text Classification.

How to fine-tune DistilBERT for text binary classification via Hugging Face API for TensorFlow. Intro. In this tutorial, you will see a binary text classification implementation with the Transfer Learning technique. For this purpose, we will use the DistilBert, a pre-trained model from the Hugging Face Transformers library and its API for Tensorflow. Why DistilBert.…

NLP: Text Pre-processing and Feature Engineering. Python.

Pre-process text data, create new features (including target variable for binary classification) with Python: Numpy, Pandas, Regex, Spacy, and Tensorflow. Intro. Data pre-processing is a fundamental part of data scientist work. Apart from data collecting, it is one of the principal stages. On it depends our future model’s quality and accuracy. The better we clean/prepare…

Web Scraping With No Effort. Python: BeautifulSoup, Grequests.

How to build a web scraper with BeautifulSoup and asynchronous HTTP requests (Grequests) Introduction. It is my first tutorial about web scraping. I will explain (with full code examples) how to create a web scraper using BeautifulSoup and Grequests Python libraries. Assuming you have an NLP task — collect text data from the recipe website and make…

Python Selenium: Scraping A Recipe Website

Data science insights while you scrapping a platform for data science practice. Introduction. This time I will explain (with full code examples) how to create a web scraper in eight steps using the Selenium Python framework. I will take a recipe site https://www.simplyrecipes.com/. The subject of this post can be a base part of any…

Python 2 vs 3 in 20 minute talk.

The fascinating quest of figuring out the difference between. Let’s start with the background of this race for the truth. In the first week of our boot camp training (winter 2020), we got the first team task to make a presentation “within 20 minutes” on one of 12 topics. No one attached much importance to…


Follow My Blog

Get new content delivered directly to your inbox.