Author profile

googlecloud

Developer articles and tutorials by googlecloud.

@googlecloudView on DEV

Articles by @googlecloud

Browse the latest writing surfaced through DevArt.

#gke#gcp#kubernetes

Inference on GKE Private Clusters

Setting up inference service without access to Internet Deploying an inference service on...

Maciej Strzelczyk5 min read

7 reactions0 commentsRead full article

#ai#gcp#vertexai#kubernetes

AI deployment: to host or not to host?

So you’ve built your AI application prototype. You used your own local GPU to run the AI model, or...

Maciej Strzelczyk5 min read

2 reactions0 commentsRead full article

#ai#testing#softwareengineering#promptengineering

Making Sure Your Prompt Will Be There For You When You Need It

At Google, our team (Google Cloud Samples) uses Gemini to produce thousands of samples in batches. In doing so, we’ve learned that the biggest hurdle isn’t the AI, it’s our own expectations about these tools.

Shawn Jones8 min read

5 reactions0 commentsRead full article

#ai#devops#productivity#softwareengineering

How My Team Aligns on Prompting for Production

My team at Google is automating sample code generation and maintenance. Part of that is using...

Adam Ross5 min read

19 reactions9 commentsRead full article

#googlecloud#node#devops#authentication

Why your `curl` logic just bit you 🐾

Authenticated in the CLI but still getting "Could not load default credentials"? Let's bridge the gap between gcloud and your application code.

Jennifer Davis3 min read

6 reactions0 commentsRead full article

#cloud#ai#programming

The lumberjack paradox: From theory to practice

Neal Sample called it the "Lumberjack Paradox": AI gives us a chainsaw, but we risk forgetting how to use the axe. In this post, I explore why code samples are the critical "line of representation" for modern engineering, and why fragmented documentation isn't just confusing developers—it's poisoning the AI models they rely on.

Jennifer Davis5 min read

12 reactions1 commentsRead full article

#security#googlecloud#nvidia#gpu

How to enable Secure Boot for your AI workloads

Written in cooperation with Aron Eidelman. As organizations race to deploy powerful GPU-accelerated...

Maciej Strzelczyk6 min read

0 reactions0 commentsRead full article

#cloud#googlecloud#tpu#gpu

Understanding Google Cloud’s Dynamic Workload Scheduler

In the age of artificial intelligence and machine learning, there is a constant need for powerful...

Maciej Strzelczyk7 min read

0 reactions0 commentsRead full article

#productivity#remote#gcp

Developing in the (Google) Cloud

As I entered the office today, it was clear that physical desktop computers are becoming a rarity....

Maciej Strzelczyk6 min read

11 reactions4 commentsRead full article

#googlecloud#observability#googlecloudnext#bigquery

Observability in Action: A Google Cloud Next demo

Quick run-down of one of the interactive demos that was presented at Next 2025, from the architecture to the products and features showcased.

Olivier Bourgeois4 min read

12 reactions0 commentsRead full article

#dockerfiles#web#axum#rust

Getting started with Rust on Google Cloud

This post will guide you through deploying a simple “Hello, World!” application on Cloud Run. You’ll...

Karl Weinmeister9 min read

0 reactions0 commentsRead full article

#ai#locallama#llm#googlecloud

Polish Large Language Model (PLLuM) on Google Cloud

"Wpadła śliwka w .... Google Cloud" 😉 Recently, thanks to the Ministry of Digital Affairs, there's...

Remigiusz Samborski1 min read

0 reactions0 commentsRead full article

#googlecloudplatform#gemini#machinelearning#ai

AI Appraiser: Discover the value of your items with Gemini on Google Cloud

While you were out shopping or cleaning up around the house, have you ever wondered what an item is...

Karl Weinmeister5 min read

0 reactions0 commentsRead full article

#machinelearning#googlecloudplatform#deepseek#ai

Attention Evolved: How Multi-Head Latent Attention Works

Compressing keys and values to reduce the cache size is MLA’s key innovation Attention is the...

Karl Weinmeister6 min read

0 reactions0 commentsRead full article

#kubernetes#ai#langchain#googlecloud

Streamline your LangChain deployments with LangServe

Learn how to streamline exposing AI models using LangChain and LangServe, deployed on Google Kubernetes Engine (GKE).

Olivier Bourgeois6 min read

19 reactions1 commentsRead full article

#kubernetes#ai#langchain#googlecloud

Leverage open models like Gemma 2 on GKE with LangChain

Learn how to leverage Google Kubernetes Engine (GKE) to deploy an AI-powered LangChain application backed by an local instance of Gemma 2.

Olivier Bourgeois7 min read

26 reactions3 commentsRead full article

#kubernetes#ai#langchain#googlecloud

Deploy Gemini-powered LangChain applications on GKE

Learn how to leverage Google Kubernetes Engine (GKE) to deploy an AI-powered LangChain application backed by Gemini.

Olivier Bourgeois5 min read

12 reactions1 commentsRead full article

#deno#typescript#fresh#javascript

Create a blog with Deno 2 and Fresh

Fresh is the most popular web framework built on Deno. With the imminent Deno 2.0 launch, now is a...

Tony Pujals1 min read

5 reactions0 commentsRead full article

#kubernetes#ai#langchain#googlecloud

Simplify development of AI-powered applications with LangChain

Let's learn what LangChain is, how it can help simplify development of AI-powered applications, and how to get started.

Olivier Bourgeois3 min read

10 reactions3 commentsRead full article

#productivity#automation#javascript#googlecloud

Write to Google Sheets from a local script via gcloud CLI authentication

Recently I needed to pull data from the GitHub API and publish to a Google Sheet so I could share...

Adam Ross4 min read

28 reactions6 commentsRead full article

#node#typescript#javascript

Built-in TypeScript Support with Node.js

Node.js 22.6.0 adds a new option for lightweight TypeScript support. What's nice about this is that...

Tony Pujals1 min read

2 reactions0 commentsRead full article

#node

Building Standalone Executables With Node.js

Node.js has experimental support for building a single executable application, or SEA, which is what...

Tony Pujals1 min read

5 reactions0 commentsRead full article

#go#programming#tooling

Command-Line Tools with Go: Piping Data

Unix is well-known for advocating the philosophy that commands should do one thing and do it...

Tony Pujals3 min read

8 reactions1 commentsRead full article

#ai#kubernetes#googlecloud#aiops

Intro to Ray on GKE

This post continues my AI exploration series with a look at the open source solution Ray and how it...

Kaslin Fields6 min read

26 reactions0 commentsRead full article

#productivity#webdev#github#tooling

Where to Host My Static Tech Blog

Context: I'm a decision record enthusiast and will absolutely write more about this in the future....

Adam Ross4 min read

7 reactions2 commentsRead full article

Text classification with Gemini and LangChain4j

Generative AI has potential applications far beyond chatbots and Retrieval Augmented Generation. For...

Guillaume Laforge8 min read

0 reactions0 commentsRead full article

Latest Gemini features support in LangChain4j 0.32.0

LangChain4j 0.32.0 was released yesterday, including my pull requestwith the support for lots of new...

Guillaume Laforge13 min read

0 reactions0 commentsRead full article

#machinelearning#googlecloudplatform#gemini#vertexai

Counting Gemini text tokens locally

The Vertex AI SDK for Python now offers local tokenization. This feature allows you to calculate...

Laurent Picard3 min read

3 reactions1 commentsRead full article

The power of embeddings: How numbers unlock the meaning of data

Prelude As I’m focusing a lot on Generative AI, I’m curious about how things work under...

Guillaume Laforge5 min read

0 reactions0 commentsRead full article

Let's make Gemini Groovy!

The happy users of Gemini Advanced, the powerful AI web assistant powered by the Gemini model, can...

Guillaume Laforge5 min read

0 reactions0 commentsRead full article