Tutorial: Serializing LLM Pipelines


  • Level: Beginner
  • Time to complete: 10 minutes
  • Components Used: HuggingFaceLocalGenerator, PromptBuilder
  • Prerequisites: None
  • Goal: After completing this tutorial, you’ll understand how to serialize and deserialize between YAML and Python code.

Overview

📚 Useful Documentation: Serialization

Serialization means converting a pipeline to a format that you can save on your disk and load later. It’s especially useful because a serialized pipeline can be saved on disk or a database, get sent over a network and more.

Although it’s possible to serialize into other formats too, Haystack supports YAML our of the box to make it easy for humans to make changes without the need to go back and forth with Python code. In this tutorial, we will create a very simple pipeline in Python code, serialize it into YAML, make changes to it, and deserialize it back into a Haystack Pipeline.

Preparing the Colab Environment

Installing Haystack

Install Haystack 2.0 Beta with pip:

%%bash

pip install haystack-ai

Enabling Telemetry

Knowing you’re using this tutorial helps us decide where to invest our efforts to build a better product but you can always opt out by commenting the following line. See Telemetry for more details.

from haystack.telemetry import tutorial_running

tutorial_running(29)

Creating a Simple Pipeline

First, let’s create a very simple pipeline that expects a topic from the user, and generates a summary about the topic with google/flan-t5-large. Feel free to modify the pipeline as you wish. Note that in this pipeline we are using a local model that we’re getting from Hugging Face. We’re using a relatively small, open-source LLM.

from haystack import Pipeline
from haystack.components.builders import PromptBuilder
from haystack.components.generators import HuggingFaceLocalGenerator

template = """
Please create a summary about the following topic:
{{ topic }}
"""
builder = PromptBuilder(template=template)
llm = HuggingFaceLocalGenerator(model="google/flan-t5-large",
                                      task="text2text-generation",
                                      generation_kwargs={
                                        "max_new_tokens": 150,
                                        })

pipeline = Pipeline()
pipeline.add_component(name="builder", instance=builder)
pipeline.add_component(name="llm", instance=llm)

pipeline.connect("builder", "llm")
topic = "Climate change"
result = pipeline.run(data={"builder": {"topic": topic}})
print(result['llm']['replies'][0])

Serialize the Pipeline to YAML

Out of the box, Haystack supports YAML. Use dumps() to convert the pipeline to YAML:

yaml_pipeline = pipeline.dumps()

print(yaml_pipeline)

You should get a pipeline YAML that looks like the following:

components:
  builder:
    init_parameters:
      template: "\nPlease create a summary about the following topic: \n{{ topic }}\n"
    type: haystack.components.builders.prompt_builder.PromptBuilder
  llm:
    init_parameters:
      generation_kwargs:
        max_new_tokens: 150
      huggingface_pipeline_kwargs:
        device: cpu
        model: google/flan-t5-large
        task: text2text-generation
        token: null
      stop_words: null
    type: haystack.components.generators.hugging_face_local.HuggingFaceLocalGenerator
connections:
- receiver: llm.prompt
  sender: builder.prompt
max_loops_allowed: 100
metadata: {}

Editing a Pipeline in YAML

Let’s see how we can make changes to serialized pipelines. For example, below, let’s modify the promptbuilder’s template to translate provided sentence to French:

yaml_pipeline = """
components:
  builder:
    init_parameters:
      template: "\nPlease translate the following to French: \n{{ sentence }}\n"
    type: haystack.components.builders.prompt_builder.PromptBuilder
  llm:
    init_parameters:
      generation_kwargs:
        max_new_tokens: 150
      huggingface_pipeline_kwargs:
        device: cpu
        model: google/flan-t5-large
        task: text2text-generation
        token: null
      stop_words: null
    type: haystack.components.generators.hugging_face_local.HuggingFaceLocalGenerator
connections:
- receiver: llm.prompt
  sender: builder.prompt
max_loops_allowed: 100
metadata: {}
"""

Deseriazling a YAML Pipeline back to Python

You can deserialize a pipeline by calling loads(). Below, we’re deserializing our edited yaml_pipeline:

from haystack import Pipeline
from haystack.components.builders import PromptBuilder
from haystack.components.generators import HuggingFaceLocalGenerator

new_pipeline = Pipeline.loads(yaml_pipeline)

Now we can run the new pipeline we defined in YAML. We had changed it so that the PromptBuilder expects a sentence and translates the sentence to French:

new_pipeline.run(data={"builder": {"sentence": "I love capybaras"}})

What’s next

🎉 Congratulations! You’ve serialzed a pipeline into YAML, edited it and ran it again!

If you liked this tutorial, you may also enjoy:

To stay up to date on the latest Haystack developments, you can sign up for our newsletter. Thanks for reading!