Join the R Community at ShinyConf 2023

Introduction to Data Science with Python

We’re happy to announce a new data science in Python course by Appsilon has been released and is available to everyone for free, as open-source material!

Check out the course repository on GitHub

The course is a great way to introduce yourself to the Data Science world. It does require you to know the basics of Python. It dives deeper than one might expect in the beginning and the benefits of this are seen immediately in the next lecture. Each lecture has accompanying homework with example solutions.

Intro to Data Science with Python Course outline

1. Introduction to the Data Science World 

  • Explanation of various terms like ML/DS/AI/DNN
  • Modern environment setup
  • The Notebook way of writing code

2. Numerical Operations in Python 

  • Introduction to the numpy library

3. Overview of Different Data Formats Used in the Data Science

  • Deep dive into dataframes using the pandas library

4. Basics of Data Visualization in Python 

5. Machine Learning for Beginners Using scikit-learn Library

6. Reporting, Deployment, and Data Presentation 

  • Multiple examples of shareable dashboards in streamlit
  • Elegant and automatic reports in quarto
  • RESTful APIs in fastapi

Prepping for a data science interview? Practice these top 10 Data Science SQL interview questions.

Intro to Data Science in Python Course overview

In this course, we use tools that can be easily leveraged in RStudio (Posit) Connect to work and share your data science products. RStudio Connect is an excellent way for Data Scientists to use their preferred tools and share insights with teams/clients in one, easy-to-use location. You can host Jupyter Notebooks, Streamlit apps, Quarto reports, fastapi APIs, and more! Deliver key insights to decision-makers, at the right time, in the right format, without sacrificing your favorite tools.

You most likely won’t need Connect if you’re an individual starting out, but if you work in a team or need enterprise solutions – we highly recommend it. 

Get our complete guide to gradient boosting and XGBoost in R.

The course also tries to sneak in some more advanced libraries like numba, polars, and xgboost by using convincing use cases. We encourage you to explore these further.

Who is this Data Science course for?

This course is for those with some previous knowledge of Python; particularly those who are interested in applying Python for data science. We’ll show tricks and tips useful for STEM/economic students, but these are also applicable to a wide array of subjects. 

Ready for more? Try these top 5 data science take-home challenges in R and see if you can translate them to Python!

The secondary goal of the course is to show participants how to use free tools that are industry standards. We will avoid proprietary software like Matlab/Statistica/SAS. 

Course license

The course has been prepared by Piotr Pasza Storożenko from Appsilon. It is available under CC BY 4.0 license. Feel free to use these materials for your use and courses; please attribute the original author.

Why is Appsilon sharing a course for Data Science in Python?

As contributors to R open source, Appsilon believes in the flexibility and utility of open-source materials. We’ve contributed several R packages that make R shiny apps more beautiful, dynamic, and secure. We do this to benefit the R community and improve data science tooling for all users. 

Get the most out of your Machine Learning and Data Science projects with PyTorch Lightning and Hydra.

Appsilon develops and scales Shiny apps for Fortune 500 companies, NGOs, research institutions, government agencies, and more. We’ve scaled Shiny apps for enterprise customers that are capable of thousands of simultaneous users and can operate on a distributed infrastructure. In the process we’ve demonstrated the impressive nature of R and Shiny for data science storytellers; we made several of these free-use Shiny Templates

And with Shiny for Python, we look forward to evangelizing Posit’s technology in the Python community. If you’re interested in trying out Shiny for Python, we’ve provided a short tutorial on how to get started

We hope to share the power of Shiny with Data Scientists in both R and Python camps. We hope this course helps users get started on their data science journey and encourages new insights using Shiny!