Cookbook
Cookbook \ Instructor \ Basics
- Basic use
- Specifying required and optional parameters via constructor
- Getters and setters
- Private vs public object field
- Basic use via mixin
- Fluent API
- Handling errors with `Maybe` helper class
- Mixed Type Property
- Modes
- Making some fields optional
- Automatic correction based on validation results
- Using attributes
- Using LLM API connection presets from config file
- Validation
- Custom validation using Symfony Validator
- Validation across multiple fields
- Validation with LLM
Cookbook \ Instructor \ Advanced
- Use custom configuration providers
- Context caching (structured output)
- Customize parameters of LLM driver
- Custom prompts
- Customize parameters via DSN
- Extracting arguments of function or method
- Logging monolog
- Logging psr
- Streaming partial updates during inference
- Providing example inputs and outputs
- Extracting scalar values
- Extracting sequences of objects
- Streaming
- Structures
Cookbook \ Instructor \ Troubleshooting
Cookbook \ Instructor \ LLM API Support
Cookbook \ Instructor \ Extras
- Extraction of complex objects
- Extraction of complex objects (Anthropic)
- Extraction of complex objects (Cohere)
- Extraction of complex objects (Gemini)
- Using structured data as an input
- Image processing - car damage detection
- Image to data (OpenAI)
- Image to data (Anthropic)
- Image to data (Gemini)
- Generating JSON Schema from PHP classes
- Generating JSON Schema from PHP classes
- Generating JSON Schema dynamically
- Create tasks from meeting transcription
- Translating UI text fields
- Web page to PHP objects
Cookbook \ Polyglot \ LLM Basics
- Working directly with LLMs
- Working directly with LLMs and JSON - JSON mode
- Working directly with LLMs and JSON - JSON Schema mode
- Working directly with LLMs and JSON - MdJSON mode
- Working directly with LLMs and JSON - Tools mode
- Generating JSON Schema from PHP classes
- Generating JSON Schema from PHP classes
Cookbook \ Polyglot \ LLM Advanced
Cookbook \ Polyglot \ LLM Troubleshooting
Cookbook \ Polyglot \ LLM API Support
Cookbook \ Polyglot \ LLM Extras
Cookbook \ Prompting \ Zero-Shot Prompting
Cookbook \ Prompting \ Few-Shot Prompting
Cookbook \ Prompting \ Thought Generation
Cookbook \ Prompting \ Miscellaneous
- Arbitrary properties
- Consistent values of arbitrary properties
- Chain of Summaries
- Chain of Thought
- Single label classification
- Multiclass classification
- Entity relationship extraction
- Handling errors
- Limiting the length of lists
- Reflection Prompting
- Restating instructions
- Ask LLM to rewrite instructions
- Expanding search queries
- Summary with Keywords
- Reusing components
- Using CoT to improve interpretation of component data
Cookbook \ Polyglot \ LLM Advanced
Embeddings
Overview
Embeddings
class offers access to embeddings APIs which allows to generate
vector representations of inputs. These embeddings can be used to compare
semantic similarity between inputs, e.g. to find relevant documents based
on a query.
Embeddings
class supports following embeddings providers:
- Azure
- Cohere
- Gemini
- Jina
- Mistral
- OpenAI
Embeddings providers access details can be found and modified via
/config/embed.php
.
To store and search across large sets of vector embeddings you may want to use one of the popular vector databases: PGVector, Chroma, Pinecone, Weaviate, Milvus, etc.
Example
Copy
<?php
require 'examples/boot.php';
use Cognesy\Polyglot\Embeddings\Embeddings;
use Cognesy\Polyglot\Embeddings\Utils\EmbedUtils;
$query = "technology news";
$documents = [
'Computer vision models are used to analyze images and videos.',
'The bakers at the Nashville Bakery baked 200 loaves of bread on Monday morning.',
'The new movie starring Tom Hanks is now playing in theaters.',
'Famous soccer player Lionel Messi has arrived in town.',
'News about the latest iPhone model has been leaked.',
'New car model by Tesla is now available for pre-order.',
'Philip K. Dick is an author of many sci-fi novels.',
];
$inputs = array_merge([$query], $documents);
$topK = 3;
// generate embeddings for query and documents (in a single request)
$response = (new Embeddings)
->using('openai')
->withInputs($inputs)
->get();
// get query and doc vectors from the response
[$queryVectors, $docVectors] = $response->split(1);
$queryVector = $queryVectors[0]
?? throw new \InvalidArgumentException('Query vector not found');
// calculate cosine similarities
$similarities = EmbedUtils::findTopK($queryVector, $docVectors, $topK);
// print documents most similar to the query
echo "Query: " . $query . PHP_EOL;
$count = 1;
foreach($similarities as $index => $similarity) {
echo $count++;
echo ': ' . $documents[$index];
echo ' - cosine similarity to query = ' . $similarities[$index];
echo PHP_EOL;
}
assert(!empty($similarities));
?>
Assistant
Responses are generated using AI and may contain mistakes.