[ad_1]
How to know the unknowable in observational studies Introduction Problem Setup 2.1. Causal Graph 2.2. Model With and Without Z 2.3. Strength of Z as a Confounder Sensitivity Analysis 3.1. Goal 3.2. Robustness Value PySensemakr Conclusion Acknowledgements References The specter of unobserved confounding (aka omitted variable bias) is a notorious problem in observational studies.…
[ad_1]
Automation, machine learning and LLMs in the chip industry (source: chatGPT)I felt like one of those guys from Monsters Inc. You know, the ones in the big yellow hazmat suits. A necessary precaution! I was entering the most complex manufacturing environment in the world. One that requires so much precision that even microscopic particulates…
[ad_1]
How to Create a Speech-to-Text-to-Speech Program Image by Mariia Shalabaieva from unsplashIt’s been exactly a decade since I started attending GeekCon (yes, a geeks’ conference 🙂) — a weekend-long hackathon-makeathon in which all projects must be useless and just-for-fun, and this year there was an exciting twist: all projects were required to incorporate some…
[ad_1]
Understand Semantic Structures with Transformers and Topic Modeling We live in the age of big data. At this point it’s become a cliche to say that data is the oil of the 21st century but it really is so. Data collection practices have resulted in huge piles of data in just about everyone’s hands.…
[ad_1]
A handy reference on migrating bookmarks, terminal enhancements, and AWS Cli settings Image generated by author using midjourneyI recently received a new 16-inch MacBook Pro with the latest Apple M3 chip for my work computer. I had heard rave reviews about the blazing-fast Apple M1 and M2 chips, so I was incredibly excited to…
[ad_1]
Advanced techniques to process and load data efficiently AI-generated image using KandinskyIn this story, I would like to talk about things I like about Pandas and use often in ETL applications I write to process data. We will touch on exploratory data analysis, data cleansing and data frame transformations. I will demonstrate some of…
[ad_1]
Build your own book recommender with CatBoost Ranker 14 min read · 16 hours ago In today’s digital world, where information overload and wide product offer is the norm, being able to help customers find what they need and like can be an important factor to make our company stand out…
[ad_1]
How to build a modern, scalable data platform to power your analytics and data science projects (updated) Table of Contents: What’s changed? Since 2021, maybe a better question is what HASN’T changed? Stepping out of the shadow of COVID, our society has grappled with a myriad of challenges — political and social turbulence, fluctuating…
[ad_1]
From a probability density function to random samples Photo by Moritz Kindler on UnsplashT here are different methods for updating a reinforcement learning agent’s policy at each iteration. A few weeks ago we started experimenting with replacing our current method with a Bayesian inference step. Some of the data workloads within our agent are…
[ad_1]
In 2022, my portfolio helped me get my first DS job. Now I’m tearing it down and starting again from scratch Image by David Pisnoy on UnsplashIf you’re a data scientist or aspiring data scientist, keeping an online portfolio is a fantastic way to showcase your skills to prospective employers. I made my first…