Models

Model Guides

The models that are being used in our platform are listed below in detail.

Model	Company	Modality	Type	License
gpt-35-turbo-16k	OpenAI	text	generative	Proprietary
gpt-4	OpenAI	multimodal	generative	Proprietary
gpt-4o	OpenAI	multimodal	generative	Proprietary
amazon.titan-tg1-large	Amazon Bedrock	text	generative	Proprietary
gemini-pro	Google VertexAI	text	generative	Proprietary
gemini-1.5-pro	Google VertexAI	multimodal	generative	Proprietary
gemini-1.5-flash	Google VertexAI	multiimodal	generative	Proprietary
jais-30b-chat	Cerebras Systems	text	generative	Open-Source
text-embedding-ada-002	OpenAI	text	embedding	Proprietary
textembedding-gecko	Google VertexAI	text	embedding	Proprietary
dalle	OpenAI	multimodal	generative	Proprietary
stable-diffusion	Stability AI	multimodal	generative	Open-Source

Model Description

gpt-35-turbo-16k

The model is OpenAI model, pretrained on large dataset. GPT-3.5 Turbo with a 16k token context window is an extended version of GPT-3.5 designed to handle larger inputs and provide more coherent and comprehensive outputs. It’s suitable for more complex conversations and tasks involving substantial amounts of text. The max_tokens is 16384 and max_output_token is 4096.

gpt-4

GPT-4 with a 128k token context length provides a significant advantage in handling extremely large documents or conversations. This model can maintain long-form continuity and contextual accuracy over extended interactions. The max_tokens is 128000 and max_output_token is 4096. The input and output type can be text or image.

gpt-4o

This version of GPT-4, optimized for even larger contexts, supports up to 128k tokens, making it ideal for tasks involving large amounts of data or long-term context maintenance. It is an ideal choice for intricate applications that require substantial context retention. The max_tokens is 128000 and max_output_token is 4096. The input and output type can be text or image.

amazon.titan-tg1-large

Titan-TG1-Large is one of Amazon’s large language models, available via AWS. It is designed for various business and conversational applications, with solid performance for handling relatively large amounts of text but with a more limited context length compared to other models. The max_tokens is 4096 and max_output_token is 2048.

gemini-pro

Gemini Pro is part of Google’s Gemini family of models, aimed at providing high-quality natural language processing with a focus on productivity and creative tasks. It’s optimized for handling long inputs, such as articles or reports. The max_tokens is 32760 and max_output_token is 8192.

gemini-1.5-pro

Gemini 1.5 Pro is an upgraded version of Google’s Gemini series, offering improved performance for professional and creative applications. It supports advanced language understanding with a focus on generating high-quality text for various industries. The max_tokens is 2097152 and max_output_token is 8192. The input type can be text, code, images, audio, video, video with audio, PDF whereas the output type is text only.

gemini-1.5-flash

Gemini 1.5 Flash is a lighter version of the Gemini 1.5 Pro model, optimized for faster responses and efficient handling of shorter inputs. Ideal for applications that require quick, real-time interactions with less need for long-term context. The max_tokens is 1048576 and max_output_token is 8192. The input type can be text, code, images, audio, video, video with audio, video, video with audio or PDF whereas the output type is text only.

jais-30b-chat

Jais 30B Chat is a large language model with 30 billion parameters, designed for conversational AI applications. It has a medium context window suited for various chatbot and interactive applications. The max_tokens is 8192 and max_output_token is 2048.

text-embedding-ada-002

text-embedding-ada-002 is an embedding model developed by OpenAI, designed to convert text into high-dimensional vector representations. These vectors are useful for tasks like semantic search, clustering, classification, and similarity comparisons. It’s an advanced version of the Ada embedding series, with a focus on performance and accuracy in embedding generation. It processes textual input and outputs dense vector representations that can capture the semantic meaning of the input text. The dimension size is 1536.

textembedding-gecko

textembedding-gecko is a specific version of a pre-trained text embedding model offered by Google Cloud Platform (GCP) through Vertex AI and their suite of AI tools and services. The dimension size is 768.

dalle

DALLE is a powerful and creative text-to-image model from OpenAI, known for generating imaginative and realistic images from detailed text prompts. It’s primarily closed-source and available through OpenAI’s API. The input and output type can be images or text.

stable-diffusion

Stable Diffusion, on the other hand, is an open-source text-to-image model that uses a latent diffusion approach to generate high-quality images. It provides flexibility and customization options and has gained traction for being open-source, allowing widespread usage, fine-tuning, and integration in various applications. The input and output type can be images or text