InTDS Archiveby💡Mike ShakhomirovAdvanced SQL for Data ScienceExpert techniques to elevate your analysisAug 24, 20247773Aug 24, 20247773
InData Engineer ThingsbyVu TrinhI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 24, 20242.3K23Aug 24, 20242.3K23
Patrick Cuba1. Data Vault and Domain Driven Design“It is not the domain experts’ knowledge that goes to production, it is the assumption of the developers.” — Alberto BrandoliniSep 5, 20221161Sep 5, 20221161
Patrick CubaData Vault on Snowflake: Performance Tuning with KeysSnowflake continues to set the standard for Data in the Cloud by taking away the need to perform maintenance tasks on your data platform…Jul 19, 202318Jul 19, 202318
Shawn TngAutomated Scoring For Tiktok Dance ChallengeThis is a continuation of my previous post, where Google’s Mediapipe was used for multi-person pose estimation. From each video frame, we…May 20, 20213May 20, 20213
InBootcampbyJosh Cottrell-SchloemerExcel is your most overlooked design toolA designer’s perspective on the world’s #1 spreadsheet tool — how to build infographics, dashboards, presentations & moreApr 19, 202260412Apr 19, 202260412
Saumil MehtaWhy You’re Paid What You’re Paid — Five Key Tech Compensation TakeawaysWhat do you make every year? What kinds of raises have you gotten over the last few years? How are you valuing cash vs. equity? How much…Apr 2, 202284013Apr 2, 202284013
InTDS ArchivebyKhuyen TranGreat Expectations: Always Know What to Expect From Your DataEnsure Your Data Works as Expected Using PythonOct 8, 20214113Oct 8, 20214113
InTDS ArchivebyJosh TaylorFuzzy matching at scaleFrom 3.7 hours to 0.2 seconds. How to perform intelligent string matching in a way that can scale to even the biggest data sets.Jul 1, 20191.4K17Jul 1, 20191.4K17
InTDS ArchivebyNaim KabirJinja + SQL = ❤️Macros for maintainable, testable data analyticsAug 15, 20212982Aug 15, 20212982
Kosma FuławkaWhat I’ve learned setting up 12 Databricks environmentsData Engineering in practice. Preparing an enterprise grade environment in a huge organization is not a piece of cake. In fact, it is quite…Nov 19, 2021891Nov 19, 2021891
InBluecore EngineeringbyJessica LaughlinWe’re All Using Airflow Wrong and How to Fix ItTl;dr: only use Kubernetes OperatorsAug 3, 20184.5K55Aug 3, 20184.5K55
Ruurtjan PulUnderstanding Kafka with FactorioWhile playing Factorio the other day, I was struck by the many similarities with Apache Kafka.Apr 27, 20191.3K6Apr 27, 20191.3K6
InThe Airbnb Tech BlogbyVaughn QuossData Quality at AirbnbPart 2 — A New Gold StandardNov 24, 20201.1K6Nov 24, 20201.1K6
Patrick BaconWhy do Aggressive Defensemen Experience Sharp Declines at Young Ages?Alex Pietrangelo’s rough start is an excellent case study of a broader concept.Nov 4, 2021664Nov 4, 2021664
David B.Old: Introducing a v2 - version of org-chart libCheckout v3 org chart introduction hereAug 14, 20213728Aug 14, 20213728
InTDS ArchivebySara A. Metwalli5 Games That Can Help You Improve Your Skills As a Data ScientistImprove your skills and have fun at the same time.Sep 19, 20214432Sep 19, 20214432
InTDS ArchivebyPaul Singman3 Data Lake Anti-Patterns to AvoidRid yourself of these troubling habits and start the journey towards data lake mastery!Mar 30, 20211631Mar 30, 20211631
InFT Product & TechnologybyMihail PetkovFinancial Times Data Platform: From zero to heroAn in-depth walkthrough of the evolution of our Data PlatformDec 2, 20201.2K9Dec 2, 20201.2K9
InCodeXbyRudderStackThe Future of Data Pipeline Tools Must Include Better Transformations Than ETL Ever HadEverybody hates data transformations in their pipeline tools. Developers hated the transformations in ETL tools so much that they came up…Jul 12, 2021473Jul 12, 2021473