InLevel Up CodingbyYousry MohamedDelta Lake Liquid Clustering — A visual explanationHow to optimize lakehouse data storage layout with minimal effort.Jan 28, 20243Jan 28, 20243
Steve RussoUse Rust to Write Spark AppsUntil Spark 3.4, developing and deploying a Spark application was sometimes a big hassle. Getting Spark running locally for development…Jul 3, 2024Jul 3, 2024
InDBSQL SME EngineeringbyDatabricks SQL SMEUnderstanding Data Access Patterns with Unity Catalog LineageAuthor: Peter Davis, Sr. Solutions Architect @ DatabricksJul 11, 20242Jul 11, 20242
InThe PayPal Technology BlogbyIlay ChenLeveraging Spark 3 and NVIDIA’s GPUs to Reduce Cloud Cost by up to 70% for Big Data PipelinesHow PayPal achieved a remarkable cloud cost reduction through strategic GPU utilizationFeb 21, 20247Feb 21, 20247
InNetflix TechBlogbyNetflix Technology BlogSequential A/B Testing Keeps the World Streaming Netflix Part 1: Continuous DataMichael Lindon, Chris Sanden, Vache Shirikian, Yanjun Liu, Minal Mishra, Martin TingleyFeb 12, 20247Feb 12, 20247
InData Engineer ThingsbyHugo LuColumn lineage is out: AI is inWhy column-level lineage will rapidly be replaced by artificial intelligenceDec 19, 202310Dec 19, 202310
Mike CvetWhat is a Feed?What’s happening under the hood on your “For you” feed?Dec 11, 20234Dec 11, 20234
InGoPenAIbyTshepiso MogoswaneExploring Data Modelling with ChatGPTPart 1: Manual Experiments with ChatGPTMay 8, 20234May 8, 20234