As described in the Enforcing JSON Schema cookbook, LLMs are now good at generating outputs that follow a specified syntax. We can combine this LLM ability with their reasoning ability to let LLMs interact with external APIs. This is called Function (or Tool) calling. In simple terms, function calling:

  1. Informs the user when a question can be answered using an external API
  2. Generates a valid request in the API’s format
  3. Converts the API’s response to a natural language answer

Function calling is currently supported on select models on Anyscale, Together AI, Fireworks AI, Google Gemini, and OpenAI. Using Portkey, you can easily experiment with function calling across various providers and gain confidence to ship it to production.

Let’s understand how it works with an example:

We want the LLM to tell what’s the temperature in Delhi today. We’ll use a “Weather API” to fetch the weather:

Node

import Portkey from "portkey-ai";

const portkey = new Portkey({
  apiKey: "PORTKEY_API_KEY",
  virtualKey: "ANYSCALE_VIRTUAL_KEY",
});

// Describing what the Weather API does and expects

let tools = [

    {
        "type": "function",
        "function": {
            "name": "getWeather",
            "description": "Get the current weather in a given location",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": "The city and state",
                    },
                    "unit": {"type": "string", "enum": ["celsius", "fahrenheit"]},
                },
                "required": ["location"],
            },
        },
    }
];

let response = await portkey.chat.completions.create({

    model: "mistralai/Mixtral-8x7B-Instruct-v0.1",
    messages: [
        {"role": "system", "content": "You are helpful assistant."},
        {"role": "user", "content": "What's the weather like in Delhi - respond in JSON"}
    ],
    tools,
    tool_choice: "auto", // auto is default, yet explicit

});

console.log(response.choices[0].finish_reason)

Here, we’ve defined what the Weather API expects for its requests in the tool param, and set tool_choice to auto. So, based on the user messages, the LLM will decide if it should do a function call to fulfill the request. Here, it will choose to do that, and we’ll see the following output:

{

    "role": "assistant",
    "content": null,
    "tool_calls": [
        "id": "call_x8we3xx",
        "type": "function",
        "function": {
            "name": "getWeather",
            "arguments": '{\n  "location": "Delhi, India",\n  "format": "celsius"\n}'
        }
    ],
}

We can just take the tool_call made by the LLM, and pass it to our getWeather function - it should return a proper response to our query. We then take that response and send it to our LLM to complete the loop:

/**

* getWeather(..) is a utility to call external weather service APIs

* Responds with: {"temperature": 20, "unit": "celsius"}

**/

let weatherData = await getWeather(JSON.parse(arguments));

let content = JSON.stringify(weatherData);

// Push assistant and tool message from earlier generated function arguments

messages.push(assistantMessage); //

messages.push({

    role: "tool",
    content: content,
    toolCallId: "call_x8we3xx"
    name: "getWeather"

});

let response = await portkey.chat.completions.create({

  model: "mistralai/Mixtral-8x7B-Instruct-v0.1",
  tools:tools,
  messages:messages,
  tool_choice: "auto",

});

We should see this final output:

{
    "role": "assistant",
    "content": "It's 30 degrees celsius in Delhi, India.",

}

Function Calling Workflow

Recapping, there are 4 key steps to doing function calling, as illustrated below:

Function Calling Workflow

Function Calling Workflow

Supporting Models

While most providers have standard function calling as illustrated above, models on Together AI & select new models on OpenAI (gpt-4-turbo-preview, gpt-4-0125-preview, gpt-4-1106-preview, gpt-3.5-turbo-0125, and gpt-3.5-turbo-1106) also support parallel function calling - here, you can pass multiple requests in a single query, the model will pick the relevant tool for each query, and return an array of tool_calls each with a unique ID. (Read here for more info)

Model/ProviderStandard Function CallingParallel Function Calling
mistralai/Mistral-7B-Instruct-v0.1 Anyscale

mistralai/Mixtral-8x7B-Instruct-v0.1Anyscale

mistralai/Mixtral-8x7B-Instruct-v0.1Together AI

mistralai/Mistral-7B-Instruct-v0.1Together AI

togethercomputer/CodeLlama-34b-InstructTogether AI

gpt-4 and previous releases OpenAI / Azure OpenAI

(some)
gpt-3.5-turbo and previous releases OpenAI / Azure OpenAI

(some)
firefunction-v1Fireworks

fw-function-call-34b-v0Fireworks

gemini-1.0-progemini-1.0-pro-001gemini-1.5-pro-lates Google