Last week I attended the AWS Summit Johannesburg. This was the first summit to be hosted in my own country and my own ci...
Spark Analysers: Catching Anti-Patterns In Spark Apps
Introduction Apache Spark™ is a widely used open source distributed computing engine. It is one of the main components o...
Identifying Green Vehicles for a Zero-Emission Future
Introduction Uber has made a public commitment to phase out carbon emissions in the United States, Canada, and Europe by...
Risk Entity Watch – Using Anomaly Detection to Fight Fraud
Background At its core, Uber operates marketplaces. With our Rides business, the two sides of the marketplace are driver...
Amazon Bedrock Is Now Generally Available – Build and Scale Generative AI Applications with Foundation Models
This April, we announced Amazon Bedrock as part of a set of new tools for building with generative AI on AWS. Amazon Bed...
Amazon MSK Introduces Managed Data Delivery from Apache Kafka to Your Data Lake
I’m excited to announce today a new capability of Amazon Managed Streaming for Apache Kafka (Amazon MSK) that allows you...
Automated Audit Framework For Internet Scale Financial Transactions
Introduction Uber, like most companies that charge customers for services, needs to record the financial interactions wi...
AWS Weekly Roundup: Amazon EC2 M2 Pro Mac, Amazon Coretto 21, Amazon CloudWatch Synthetics, and more (Sept. 25, 2023)
This week, I’m in Jakarta to support AWS User Group Indonesia and AWS Cloud Day Indonesia. Yesterday, I attended a commu...
Announcing Cadence 1.0: The Powerful Workflow Platform Built for Scale and Reliability
Introduction We’re thrilled to announce the long-awaited release of Cadence v1.0.0, a major milestone for our team and p...
Fast Copy-On-Write within Apache Parquet for Data Lakehouse ACID Upserts
Overview With the evolution of storage table formats Apache Hudi®, Apache Iceberg®, and Delta Lake™, more and more...