Flock is a framework for orchestrating LLM-powered agents. It leverages a declarative approach where you simply specify what each agent needs as input and what it produces as outputβwithout having to write lengthy, brittle prompts. Under the hood, Flock transforms these declarations into robust workflows, using cutting-edge components such as Temporal and DSPy to handle fault tolerance, state management, and error recovery.
Traditional Agent Frameworks π | Flock π€π§ππ¦ |
---|---|
π€ Complex Prompt Engineering | π Declarative Agent Definitions |
β’ Lengthy, brittle prompts | β’ Clear, concise input/output declarations |
β’ Hard-to-tune and adapt | β’ No need for manual prompt engineering |
π₯ Fragile Execution | β‘ Robust & Scalable |
β’ Single failure can break the system | β’ Fault-tolerant with built-in retries and error handling |
β’ Difficult to monitor and recover | β’ Automatic recovery via Temporal workflow integration |
ποΈ Rigid Workflows | π Flexible Orchestration |
β’ Limited adaptability | β’ Dynamic agent chaining and hand-offs |
β’ Hard to scale and parallelize | β’ Modular, concurrent, and batch processing |
flock_3min_all.mp4
-
Declarative Agent System:
When you order a pizza at your favorite place, you just tell them what pizza you want, not the 30 steps to get there! And thanks to an invention called LLMs, we now have the technology to come up with those 30 steps.Flock takes advantage of that.
Define agents by declaring their input/output interfaces (with type hints and human-readable descriptions) using a concise syntax.
The framework automatically extracts type and description details, builds precise prompts, and configures the underlying LLM.Testing becomes quite simple as well. You got the pizza you ordered? Passed β
-
Type Safety and Clear Contracts:
Agents are implemented as Pydantic models. This provides automatic JSON serialization/deserialization, strong typing, and an explicit contract for inputs and outputs. Testing, validation, and integration become straightforward. -
Unparalleled Flexibility:
Each agent (via the newFlockAgent
base class) supports lifecycle hooks such asinitialize()
,terminate()
,evaluate()
, andon_error()
. This ensures that agents can perform setup, cleanup, and robust error handlingβall without cluttering the main business logic. Everything is overridable or lets you provide your own callables per callback. We meaneverything
quite literally. Except for the agent name, literally every property of an agent can be set to a callable, leading to highly dynamic and capable agents. -
Fault Tolerance & Temporal Integration:
Flock is built with production readiness in mind. By integrating with Temporal, your agent workflows enjoy automatic retries, durable state management, and resilience against failures. This means that a single agent crash won't bring down your entire system.
Let's showcase easy to understand examples to give you an idea what flock offers! All examples and/or similar examples can be found in the examples folder!
Let's start the most simple way possible π
from flock.core import Flock, FlockAgent
MODEL = "openai/gpt-4o"
flock = Flock(model=MODEL, local_debug=True)
bloggy = FlockAgent(
name="bloggy",
input="blog_idea",
output="funny_blog_title, blog_headers"
)
flock.add_agent(bloggy)
result = flock.run(
start_agent=bloggy,
input={"blog_idea": "A blog about cats"}
)
With almost no boilerplate needed, getting your first agent to run is as easy as cake!
bloggy
takes in a blog_idea
to produce a funny_blog_title
and blog_headers
. That is all!
Flock does take care of the rest, which frees you from needing to write paragraphs of text. You might think abstracting prompting like this means less control - but nope! Quite the contrary, it'll increase your control over it!
When we let bloggy
loose in the flock:
{
'funny_blog_title': '"Whisker Wonders: The Purr-fect Guide to Cat-tastrophes and Feline Follies"',
'blog_headers': (
'1. "The Cat\'s Meow: Understanding Your Feline\'s Language"\n'
'2. "Paws and Reflect: The Secret Life of Cats"\n'
'3. "Fur Real: Debunking Myths About Our Furry Friends"\n'
'4. "Claw-some Adventures: How to Entertain Your Indoor Cat"\n'
'5. "Cat-astrophic Cuteness: Why We Can\'t Resist Those Whiskers"\n'
'6. "Tail Tales: The History of Cats and Their Human Companions"\n'
'7. "Purr-sonality Plus: What Your Cat\'s Behavior Says About Them"\n'
'8. "Kitty Conundrums: Solving Common Cat Problems with Humor"'
),
'blog_idea': 'A blog about cats',
}
Look at that! A real Python object with fields exactly as we defined them in the agent. No need to mess around with parsing or post-processing! π
You probably noticed that your headers aren't a real Python list, but you need one for your downstream task. Flock got you! Just sprinkle some type hints in your agent definition! β¨
from flock.core import Flock, FlockAgent
MODEL = "openai/gpt-4o"
flock = Flock(model=MODEL, local_debug=True)
bloggy = FlockAgent(
name="bloggy",
input="blog_idea",
output="funny_blog_title, blog_headers: list[str]"
)
flock.add_agent(bloggy)
result = flock.run(
start_agent=bloggy,
input={"blog_idea": "A blog about cats"}
)
Et voila! Now you get:
{
'funny_blog_title': '"Whisker Me This: The Purr-fect Guide to Cat-tastic Adventures"',
'blog_headers': [
"The Cat's Out of the Bag: Understanding Feline Behavior",
'Paws and Reflect: The Secret Life of Cats',
'Feline Fine: Health Tips for Your Kitty',
"Cat-astrophic Cuteness: Why We Can't Resist Them",
'Meow-sic to Your Ears: Communicating with Your Cat',
'Claw-some Toys and Games: Keeping Your Cat Entertained',
'The Tail End: Myths and Facts About Cats',
],
'blog_idea': 'A blog about cats',
}
That's not enough for you, since you already got your data classes defined and don't want to redefine them again for some agents?
Also got some hard constraints, like the title needs to be in ALL CAPS? π₯
Check this out:
from pydantic import BaseModel, Field
class BlogSection(BaseModel):
header: str
content: str
class MyBlog(BaseModel):
funny_blog_title: str = Field(description="The funny blog title in all caps")
blog_sections: list[BlogSection]
Since flock is bein' pedantic about pydantic, you can just use your pydantic models like you would use type hints:
bloggy = FlockAgent(
name="bloggy",
input="blog_idea",
output="blog: MyBlog",
)
And BAM! Your finished data model filled up to the brim with data! π
{
'blog': MyBlog(
funny_blog_title='THE PURR-FECT LIFE: CATS AND THEIR QUIRKY ANTICS',
blog_sections=[
BlogSection(
header='Introduction to the Feline World',
content=(
'Cats have been our companions for thousands of years, yet they remain as mysterious and intriguin'
'g as ever. From their graceful movements to their independent nature, cats have a unique charm th'
"at captivates us. In this blog, we'll explore the fascinating world of cats and their quirky anti"
'cs that make them the purr-fect pets.'
),
),
BlogSection(
header='The Mysterious Ways of Cats',
content=(
'Ever wonder why your cat suddenly sprints across the room at 3 AM or stares at a blank wall for h'
'ours? Cats are known for their mysterious behaviors that often leave us scratching our heads. The'
'se antics are not just random; they are deeply rooted in their instincts and natural behaviors. L'
"et's dive into some of the most common and puzzling cat behaviors."
),
),
BlogSection(
header="The Art of Napping: A Cat's Guide",
content=(
"Cats are the ultimate nappers, spending up to 16 hours a day snoozing. But there's more to a catn"
'ap than meets the eye. Cats have perfected the art of napping, and each nap serves a purpose, whe'
"ther it's a quick power nap or a deep sleep. Learn how cats choose their napping spots and the sc"
'ience behind their sleep patterns.'
),
),
BlogSection(
header='The Great Cat Conspiracy: Do They Really Rule the World?',
content=(
"It's a well-known fact among cat owners that cats secretly rule the world. With their ability to "
"manipulate humans into providing endless treats and belly rubs, it's no wonder they have us wrapp"
"ed around their little paws. Explore the humorous side of cat ownership and the 'conspiracy' theo"
'ries that suggest cats are the true overlords of our homes.'
),
),
BlogSection(
header='Conclusion: Why We Love Cats',
content=(
'Despite their quirks and sometimes aloof nature, cats have a special place in our hearts. Their c'
"ompanionship, playful antics, and soothing purrs bring joy and comfort to our lives. Whether you'"
"re a lifelong cat lover or a new cat parent, there's no denying the unique bond we share with our"
" feline friends. So, here's to the purr-fect life with cats!"
),
),
],
),
'blog_idea': 'A blog about cats',
}
Our bloggy
is great, but what if we want to turn those amazing headers into full blog posts? Time to bring in a friend! π€
Let's see how easy it is to make agents work together π
from flock.core import Flock, FlockAgent
flock = Flock(model="openai/gpt-4o")
# First agent: Our trusty bloggy generates titles and headers! π
bloggy = FlockAgent(
name="bloggy",
input="blog_idea: str|The topic to blog about",
output=(
"funny_blog_title: str|A catchy title for the blog, "
"blog_headers: list[str]|List of section headers for the blog"
)
)
# Second agent: The content wizard that brings headers to life! β¨
content_writer = FlockAgent(
name="content_writer",
input=(
"funny_blog_title: str|The blog title to work with, "
"blog_headers: list[str]|The headers to expand into content"
),
output="blog_sections: list[BlogSection]|The fully written blog sections"
)
# Make them besties! π€
bloggy.hand_off = content_writer
# Add your dynamic duo to the flock
flock.add_agent(bloggy)
flock.add_agent(content_writer)
# Let them create some magic! π¨
result = flock.run(
input={"blog_idea": "A blog about cats"},
start_agent=bloggy
)
Super simple rules to remember:
- Point the first agent to the next one using
hand_off
- Make sure their inputs and outputs match up
That's all there is to it! bloggy
comes up with amazing headers, and content_writer
turns them into full blog sections. No more writer's block! π
And this is just the beginning - you can chain as many agents as you want. Maybe add a proofreader? Or an SEO optimizer? But let's not get ahead of ourselves! π
So far we've barely scratched the surface of what flock has to offer, and we're currently hard at work building up the documentation for all the other super cool features Flock has up its sleeve! Stay tuned! π
Flock supports execution on Temporal, ensuring robust, fault-tolerant workflows:
- Durability: Persistent state management even in the case of failures.
- Retries & Error Handling: Automatic recovery via Temporal's built-in mechanisms.
- Scalability: Seamless orchestration of distributed agent workflows.
Documentation in progress!
- Python 3.10+
- (Optional) Temporal server running locally for production-grade workflow features
- API keys for integrated services
recommended services
export OPENAI_API_KEY=sk-proj-
export TAVILY_API_KEY=tvly-
or in .env
For LLM interaction LiteLLM is getting used. Please refer to its documentation on how to easily use other models and/or provider.
https://docs.litellm.ai/docs/providers
pip install flock-core
if you want to use the integrated tools
pip install flock-core[tools]
and for the docling tools
pip install flock-core[all-tools]
-
Clone the Repository:
git clone https://github.com/whiteducksoftware/flock cd flock
-
Create a Virtual Environment and sync all packages:
uv sync --all-groups --all-extras
-
Install local version of flock:
uv build && uv pip install -e .
-
Install Jaeger for telemetry
docker run -d --name jaeger \ -e COLLECTOR_ZIPKIN_HTTP_PORT=9411 \ -p 5775:5775/udp \ -p 6831:6831/udp \ -p 6832:6832/udp \ -p 5778:5778 \ -p 16686:16686 \ -p 14268:14268 \ -p 14250:14250 \ -p 9411:9411 \ jaegertracing/all-in-one:1.41
-
Create your .env
Use
.env_template
as a template for you custom config variables
Contributions are welcome! Please submit Pull Requests and open issues on GitHub.
This project is licensed under the terms of the LICENSE file included in the repository.
- Built with DSPy
- Uses Temporal for workflow management
- Integrates with Tavily for web search capabilities
Flock was created to overcome the limitations of traditional agent frameworks. Key design goals include:
- Simplify Agent Definitions:
Focus on clear input/output contracts rather than long, complex prompts. - Model Agnostic:
Change LLM backends without altering agent logic. - Improved Testability:
Clear, structured interfaces facilitate unit testing and validation.
- Fault Tolerance:
Leveraging Temporal for automatic retries, durable state, and robust error handling. - Scalability:
Support for concurrent, batch, and distributed workflows. - Observability:
Built-in logging and monitoring for real-time insights into workflow execution.
- Expanded type system for richer agent interactions
- Enhanced tool ecosystem and custom integrations
- Advanced monitoring, debugging, and performance metrics
- Extended testing frameworks and validation tools
Join us in building the next generation of reliable, production-ready AI agent systems! Become part of the FLOCK!