Building a Simple LLM Agent Workflow with LangGraph

Author

Peyman Kor

Published

January 28, 2025

Introduction to AI Agent

These days there is a lot of discussion about AI agents, and it seems to be becoming a hot topic. Essentially, it’s an improvement over the basic LLM agent by adding 1) memory 2) tools and 3) planning capacity, allowing us to get more out of the standalone LLM.

Here’s my initial idea:

The memory can be both short-term (Short-term memory can be the dialog history) or long-term
The tools can be API calls, so the LLM can access specific information, for example internet API to get the additional information
The planning section shows how the LLM can divide multiple tasks and execute them

In this short blog, I’m going to first show what LangGraph is - a well-established and structured Python framework for building LLM agents. Then, what we want to do is very simple:

We start with one question (a query)
That query goes to the LLM tools (Wikipedia and web search)
Both tools generate some results
We combine those results
We give that context to the LLM so it can reply now with the context

Essentially, we are increasing the power of the LLM with additional tools. It’s like a chef in the kitchen - if you give them food order, they can cook, but if you give them more tools like a heater or different equipment, they will be much more capable. That’s the whole idea of the agent, and we will continue working with that idea.

Setting up the API keys

First, we need to provide the OpenAI API key to use their LLM services. In my case, I store the API key in a .env file for better security and load it from there. Alternatively, you can directly set the API key in the code, which I’ve shown as Method 2 below. Both approaches work, but using a .env file is generally considered better practice since it keeps sensitive credentials separate from the code.

# Method 1
import os
from dotenv import load_dotenv

load_dotenv(override=True)

openai_api_key = os.getenv("OPENAI_API_KEY")

# Method 2
#import os
#os.environ["OPENAI_API_KEY"] = "your-api-key"

Report with Standalone LLM

Now let’s get started. First, we import ChatOpenAI from langchain_openai. I’m using GPT-4-mini since it’s the most cost-effective option. My test query is “Briefly explain me DeepSeek and how it affected NVIDIA Share price” - I purposefully selected this query to benchmark how the LLM agent improves the response compared to using just the LLM alone. This serves as an evaluation metric. The code uses a simple system message to set up the assistant role, and I save the report from the LLM in a markdown file for better readability.

from langchain_openai import ChatOpenAI

llm = ChatOpenAI(model="gpt-4o-mini", temperature=0)

query = "Briefly explain me DeepSeek and how it affected NVIDIA Share price"

messages = [
    (
        "system",
        "You are a helpful assistant that give me short report about the user question",
    ),
    ("human", query),
]
answer = llm.invoke(messages)

# Save answer content to markdown file
with open("deepseek_report_llm.md", "w") as f:
    f.write("# Report using LLM\n\n")
    f.write(answer.content)

Agentic Workflow

Now we’ll enhance our LLM with additional tools to produce a more informed report (previous report was merely hallucination and not satisfactory). We’ll use two main tools:

Web search using Tavily
Wikipedia search

The workflow is implemented using a State class that tracks: - question: The user’s original query - context: Combined results from web and Wikipedia searches (which is a list, added together) - answer: The final response generated by the LLM

The State class uses a TypedDict to enforce these three fields.


from typing import Any
from typing_extensions import TypedDict
from typing import Annotated
import operator

class State(TypedDict):
    question: str
    answer: str
    context: Annotated[list, operator.add]


# You need to get the tavily api key from the tavily website and put it in the .env file
# https://app.tavily.com/
tavily_api_key = os.getenv("TAVILY_API_KEY")

Define the Tools

Here we have two search functions. In both functions, we take a question from the state (which we defined in the previous section) and pass it to either web search or Wikipedia search. These tools provide additional context to the LLM so it can generate a better informed report.

from langchain_core.messages import HumanMessage, SystemMessage
from langchain_community.document_loaders import WikipediaLoader
from langchain_community.tools import TavilySearchResults


def search_web(state):
    
    """ Retrieve docs from web search """

    # Search
    tavily_search = TavilySearchResults(max_results=5)
    search_docs = tavily_search.invoke(state['question'])

     # Format
    formatted_search_docs = "\n\n---\n\n".join(
        [
            f'<Document href="{doc["url"]}"/>\n{doc["content"]}\n</Document>'
            for doc in search_docs
        ]
    )

    return {"context": [formatted_search_docs]} 


def search_wikipedia(state):
    
    """ Retrieve docs from wikipedia """

    # Search
    search_docs = WikipediaLoader(query=state['question'], 
                                  load_max_docs=2).load()

     # Format
    formatted_search_docs = "\n\n---\n\n".join(
        [
            f'<Document source="{doc.metadata["source"]}" page="{doc.metadata.get("page", "")}"/>\n{doc.page_content}\n</Document>'
            for doc in search_docs
        ]
    )

    return {"context": [formatted_search_docs]}

Generate the answer

Now with that context the generate answer function takes that state which has a question and also context that provided from the previous two tools function and now we answer the same question which was about NVIDIA and also from the deep seek with this context. So that’s the kind of the bringing the more context to the workflow.

def generate_answer(state):
    
    """ Node to answer a question """

    # Get state
    context = state["context"]
    question = state["question"]

    # Template
    answer_template = """Answer the question {question} using this context: {context}"""
    answer_instructions = answer_template.format(question=question, 
                                                       context=context)    
    
    # Answer
    
    answer = llm.invoke([SystemMessage(content=answer_instructions)]+[HumanMessage(content=f"Answer the question.")])
      
    # Append it to state
    return {"answer": answer, "context": context}

Building the graph

Now we build the graph. We start with the state, and then we add the nodes. We add the search_web and search_wikipedia nodes, and then we add the generate_answer node. We then add the edges (simply it is how the nodes are connected), and then we compile the graph.

Note that in the in the final graph we see that we have a 5 nodes but just the first note is a start and the last note is the end, but those are the default one. But then we only have a 31 which is two of the tools, and then gathering and getting the answer using a tool which is another tools.

from langgraph.graph import StateGraph, START, END
builder = StateGraph(State)


# Initialize each node with node_secret 
builder.add_node("search_web",search_web)
builder.add_node("search_wikipedia", search_wikipedia)
builder.add_node("generate_answer", generate_answer)

# Add edges, flow
# Flow
builder.add_edge(START, "search_wikipedia")
builder.add_edge(START, "search_web")
builder.add_edge("search_wikipedia", "generate_answer")
builder.add_edge("search_web", "generate_answer")
builder.add_edge("generate_answer", END)

from IPython.display import Image, display
graph = builder.compile()

display(Image(graph.get_graph().draw_mermaid_png()))

Report with Agent

Now as you can see, we are asking the same question again about DeepSeek and how it affected the NVIDIA share price. This time we are invoking the graph that we defined earlier, which is a combination of all the edges and nodes that we put together. After compiling the graph in the previous step, it is now ready to function as a workflow. As before, I save the final report as a markdown file for easier readability and comparison.

result = graph.invoke({"question": 
                       "Briefly explain me DeepSeek and how it affected NVIDIA Share price"})


# Save the answer to a markdown file
with open('deepseek_report_agent.md', 'w') as f:
    f.write("# Report using Agent\n\n")
    f.write(result['answer'].content)

Conclusion

In summary, in this blog, we explored the concept of AI agents using LangGraph, a nice Python framework for building LLM agents. We started with a simple LLM workflow and then enhanced it with tools (Wikipedia and web search) to provide more context. We built a graph to orchestrate the workflow and then used it to answer a question about DeepSeek and NVIDIA share price.

As we can see, if we just compare the report of the LLM stand alone without access to the tools with the LLM with access to the tool, we can see that the first report was not satisfactory. It was purely just a Hallucination. But when you add the latest report about NVIDIA and and the share price and also DeepSeek from both internet and wikipedia. Then your report can be more realistic. So this is a typical example that you can start with the base and weak (and cheap!) LLM model like a mini 4, but giving the proper context, proper tools, it can just get better and better.

And I want to make a similarity to the human. We human are agents too :) because essentially what happens is that when you have a more tools you are more capable. And here is in this example, the tools help the initial agent to be more capable in providing the answer that we want.