atlas news
    
The Automatic
04  avril     16h13
Software Engineering for Data Scientists (New book )
Andrew Treadway    Very excited to announce the early access preview MEAP of my upcoming book, Software Engineering for Data Scientists is available now Check it out at this link. Use promo code au tre to save on this book and any products sold from Manning. Why Software Engineering for Data Scientists Data...
27  novembre     17h08
How to stop long-running code in Python
Andrew Treadway    Ever had long running code that you don’t know when it’s going to finish running If you have, then Python’s stopit library is for you. In a previous post, we talked about how to create a progress bar to monitor Python code. This post will show you how to automatically stop long running code with...
09  octobre     21h16
Faster alternatives to pandas
Andrew Treadway    Background If you’ve done any type of data analysis in Python, chances are you’ve probably used pandas. Though widely used in the data world, if you’ve run into space or computational issues with it, you’re not alone. This post discusses several faster alternatives to pandas. R’s data table in...
02  juillet     05h00
Automated EDA with Python
Andrew Treadway    In this post, we will investigate the pandas profiling and sweetviz packages, which can be used to speed up EDA exploratory data analysis with Python. In a previous article, we talked about an analagous package in R see this link . Getting started with pandas profiling pandas profiling can be...
28  avril     00h02
How to plot XGBoost trees in R
Andrew Treadway    In this post, we’re going to cover how to plot XGBoost trees in R. XGBoost is a very popular machine learning algorithm, which is frequently used in Kaggle competitions and has many practical use cases. Let’s start by loading the packages we’ll need. Note that plotting XGBoost trees requires the...
14  avril     22h26
Python collections tutorial
Andrew Treadway    In this post, we’ll discuss the underrated Python collections package, which is part of the standard library. Collections allows you to utilize several data structures beyond base Python. How to get a count of all the elements in a list One very useful function in collections is the Counter method,...
25  mars     00h32
How to create PDF files with Python
Andrew Treadway    In a previous article we talked about several ways to read PDF files with Python. This post will cover two packages used to create PDF files with Python, including pdfkit and ReportLab. Create PDF files with Python and pdfkit pdfkit was the first library I learned for creating PDF files. A nice...
03  mars     01h06
Faster data exploration with DataExplorer
Andrew Treadway    Data exploration is an important part of the modeling process. It can also take up a fair amount of time. The awesome DataExplorer package in R aims to make this process easier. To get started with DataExplorer, you’ll need to install it like below: install.packages DataExplorer Let’s use...
16  février     13h41
How to get stock earnings data with Python
Andrew Treadway    In this post, we’ll walk through a few examples for getting stock earnings data with Python. We will be using yahoo fin, which was recently updated. The latest version now includes functionality to easily pull earnings calendar information for individual stocks or dates. If you need to install...
02  février     13h43
Technical analysis with Python
Andrew Treadway    In this post, we will introduce how to do technical analysis with Python. Python has several libraries for performing technical analysis of investments. We’re going to compare three libraries ta, pandas ta, and bta lib. The ta library for technical analysis One of the nicest features of the ta...