This repository contains all the sorce code of Python for Data Science book, in the form of Jupyter notebooks. The book, based on the author's teaching experience, aims to introduce the fundamental concepts of Python with applications in data science.
- Introduction
- Python environment
- Python basics
- Python coding
- Main Python libraries
- Data manipulation
- Exploratory Data Analysis
- Data preparation
The book introduces the fundamental concepts of Python with applications in data science. It offers a compact, self-contained, and accessible collection of methodologies and practical examples designed to support learning. The text is intended for undergraduate and graduate students who may not have a prior programming experience.
The book was written and tested with Python 3.12, though other Python 3.x versions should work in nearly all cases.
If you have Git installed, you can clone the book repository, otherwise simply download it and move to the correct folder:
git clone https://github.com/mscarpiniti/PyDSBook.git
cd PyDSBook
To use the code, it is suggested to create a new Python 3.12 environment (i.e., PyDS) with Spyder and pip:
conda create -n PyDS python=3.12 Spyder pip
Then activate the environment and install all requirements:
conda activate PyDS
pip install -r requirements.txt
Open this repository in Colaboratory:
The code in this repository, including all code samples in the notebooks listed above, is released under the Apache License, Version 2.0. Read more at the Open Source Initiative.
