Writing papers using Latex – Part 2

crop female designer drawing on paper sheet

After you have gone through the gruesome as well as the exciting task of completing your experiments and getting somewhat positive results for your research publication(maybe even your first). The next step is to prepare it for paper submission at a conference or journal. By now, you should have a working Latex environment and the … Read more

Writing papers using Latex – Part 1

a woman studying inside the library

After you have gone through the gruesome as well as the exciting task of completing your experiments and getting somewhat positive results for your research publication(maybe even your first). The next step is to prepare it for paper submission at a conference or journal. Most conferences will follow the proceedings template of either ACM (generally, … Read more

Working with Imbalanced Data sets

Theory Imbalanced data sets, in the context of supervised classification problems, refer to the case when the class distribution is highly skewed or disproportionate. Since general supervised learning algorithms assume them to be balanced, they perform accuracy maximization. However, this, in turn, will propagate a model bias and be addressed to some extent, when we … Read more

Exploratory Analysis using R

This is a good post on making visualisations with pandas data frame in Python. It covers uni-variate plots like histograms, line plots, density plots and multivariate plots like correlation plot matrix and scatter-plot matrix. Before diving into feature engineering and data cleaning, it is a good idea to have a good understanding of the data. … Read more

Top 10 Python packages for mastering Data Science

The significance of learning Python for Data Science Python has turned out to be one of the commonly used programming languages for Machine Learning as well as in Data Science in general. It has become even more prominent after the rise of Deep Learning and use of GPUs. Python boasts of robust and widely used … Read more

Python for Natural Language Processing with Pandas and NLTK

Python is the most popular programming language for Machine Learning and Natural Language Processing (NLP). Its compact nature along with an enormous collection of packages like pandas, nltk, and deep learning frameworks like Pytorch, and Tensorflow, have made it a go-to language for data science enthusiasts and programming newbies, in general. The corresponding IPython notebook … Read more

Starting with R (Windows, Linux, Mac)

Last updated on 20th October, 2019 Article summary Here, we first talk about the R programming language, followed by the installation steps for getting started with R and its IDE – RStudio. What is R? R is an useful language to learn for Data Science. I generally use it while exploring the data and the … Read more

How to prepare a technical presentation with googleslides

Presenting your research work or weekly progress to a large audience or your supervisor is an important skill to learn. Google slides (googleslides) have been a go-to for delivering a technical presentation nowadays.  This article will cover a step-by-step method that I use to prepare the first draft of my technical presentations in a short … Read more