Class: LlamaDeuce
Llama2 LLM implementation
Implements
Constructors
constructor
• new LlamaDeuce(init?
)
Parameters
Name | Type |
---|---|
init? | Partial <LlamaDeuce > |
Defined in
packages/core/src/llm/LLM.ts:436
Properties
chatStrategy
• chatStrategy: DeuceChatStrategy
Defined in
packages/core/src/llm/LLM.ts:429
hasStreaming
• hasStreaming: boolean
Implementation of
Defined in
packages/core/src/llm/LLM.ts:434
maxTokens
• Optional
maxTokens: number
Defined in
packages/core/src/llm/LLM.ts:432
model
• model: "Llama-2-70b-chat-old"
| "Llama-2-70b-chat-4bit"
| "Llama-2-13b-chat-old"
| "Llama-2-13b-chat-4bit"
| "Llama-2-7b-chat-old"
| "Llama-2-7b-chat-4bit"
Defined in
packages/core/src/llm/LLM.ts:428
replicateSession
• replicateSession: ReplicateSession
Defined in
packages/core/src/llm/LLM.ts:433
temperature
• temperature: number
Defined in
packages/core/src/llm/LLM.ts:430
topP
• topP: number
Defined in
packages/core/src/llm/LLM.ts:431
Accessors
metadata
• get
metadata(): Object
Returns
Object
Name | Type |
---|---|
contextWindow | number |
maxTokens | undefined | number |
model | "Llama-2-70b-chat-old" | "Llama-2-70b-chat-4bit" | "Llama-2-13b-chat-old" | "Llama-2-13b-chat-4bit" | "Llama-2-7b-chat-old" | "Llama-2-7b-chat-4bit" |
temperature | number |
tokenizer | undefined |
topP | number |
Implementation of
Defined in
packages/core/src/llm/LLM.ts:456
Methods
chat
▸ chat<T
, R
>(messages
, _parentEvent?
, streaming?
): Promise
<R
>
Get a chat response from the LLM
Type parameters
Name | Type |
---|---|
T | extends undefined | boolean = undefined |
R | T extends true ? AsyncGenerator <string , void , unknown > : ChatResponse |
Parameters
Name | Type | Description |
---|---|---|
messages | ChatMessage [] | The return type of chat() and complete() are set by the "streaming" parameter being set to True. |
_parentEvent? | Event | - |
streaming? | T | - |
Returns
Promise
<R
>
Implementation of
Defined in
packages/core/src/llm/LLM.ts:594
complete
▸ complete<T
, R
>(prompt
, parentEvent?
, streaming?
): Promise
<R
>
Get a prompt completion from the LLM
Type parameters
Name | Type |
---|---|
T | extends undefined | boolean = undefined |
R | T extends true ? AsyncGenerator <string , void , unknown > : ChatResponse |
Parameters
Name | Type | Description |
---|---|---|
prompt | string | the prompt to complete |
parentEvent? | Event | - |
streaming? | T | - |
Returns
Promise
<R
>
Implementation of
Defined in
packages/core/src/llm/LLM.ts:634
mapMessageTypeA16Z
▸ mapMessageTypeA16Z(messageType
): string
Parameters
Name | Type |
---|---|
messageType | MessageType |
Returns
string
Defined in
packages/core/src/llm/LLM.ts:503
mapMessagesToPrompt
▸ mapMessagesToPrompt(messages
): Object
Parameters
Name | Type |
---|---|
messages | ChatMessage [] |
Returns
Object
Name | Type |
---|---|
prompt | string |
systemPrompt | any |
Defined in
packages/core/src/llm/LLM.ts:467
mapMessagesToPromptA16Z
▸ mapMessagesToPromptA16Z(messages
): Object
Parameters
Name | Type |
---|---|
messages | ChatMessage [] |
Returns
Object
Name | Type |
---|---|
prompt | string |
systemPrompt | undefined |
Defined in
packages/core/src/llm/LLM.ts:489
mapMessagesToPromptMeta
▸ mapMessagesToPromptMeta(messages
, opts?
): Object
Parameters
Name | Type |
---|---|
messages | ChatMessage [] |
opts? | Object |
opts.replicate4Bit? | boolean |
opts.withBos? | boolean |
opts.withNewlines? | boolean |
Returns
Object
Name | Type |
---|---|
prompt | string |
systemPrompt | any |
Defined in
packages/core/src/llm/LLM.ts:516
tokens
▸ tokens(messages
): number
Calculates the number of tokens needed for the given chat messages
Parameters
Name | Type |
---|---|
messages | ChatMessage [] |
Returns
number