Table of Contents
AutoComplete
Hack Summary | |
---|---|
Name | AutoComplete |
Screenshot | |
Category | chat |
In-game description |
“Auto-completes your chat messages using large language models. Requires an OpenAI account with API access or any other language model API that is OpenAI-compatible.” |
Default keybind | none |
Source code | https://github.com/Wurst-Imperium/Wurst7/blob/master/src/main/java/net/wurstclient/hacks/AutoCompleteHack.java, https://github.com/Wurst-Imperium/Wurst7/blob/master/src/main/java/net/wurstclient/hacks/autocomplete |
AutoComplete is a Minecraft hack that generates auto-completions for the user's chat messages, using large language models like GPT-3, GPT-4 and LLaMA.
Settings
OpenAI model
OpenAI model | |
---|---|
Type | Enum |
In-game description | “The model to use for OpenAI API calls. GPT-4o-2024-08-06 is one of the smartest models at the time of writing and will often produce the best completions. However, it's meant to be an assistant rather than an auto-completion system, so you will see it produce some odd completions at times. GPT-3.5-Turbo-Instruct is an older, non-chat model based on GPT-3.5 that works well for auto-completion tasks.” |
Default value | gpt-4o-2024-08-06 |
Possible values | gpt-4o-2024-08-06, gpt-4o-2024-05-13, gpt-4o-mini-2024-07-18, gpt-4-turbo-2024-04-09, gpt-4-0125-preview, gpt-4-1106-preview, gpt-4-0613, gpt-3.5-turbo-0125, gpt-3.5-turbo-1106, gpt-3.5-turbo-instruct, davinci-002, babbage-002 |
Max tokens
Max tokens | |
---|---|
Type | Slider |
In-game description | “The maximum number of tokens that the model can generate. Higher values allow the model to predict longer chat messages, but also increase the time it takes to generate predictions. The default value of 16 is fine for most use cases.” |
Default value | 16 tokens |
Minimum | 1 token |
Maximum | 100 tokens |
Increment | 1 token |
Temperature
Temperature | |
---|---|
Type | Slider |
In-game description | “Controls the model's creativity and randomness. A higher value will result in more creative and sometimes nonsensical completions, while a lower value will result in more boring completions.” |
Default value | 1 |
Minimum | 0 |
Maximum | 2 |
Increment | 0.01 |
Note: Temperature values above 1 will cause most language models to generate complete nonsense and should only be used for comedic effect.
Top P
Top P | |
---|---|
Type | Slider |
In-game description | “An alternative to temperature. Makes the model less random by only letting it choose from the most likely tokens. A value of 100% disables this feature by letting the model choose from all tokens.” |
Default value | 100% |
Minimum | 0% |
Maximum | 100% |
Increment | 1% |
Presence penalty
Presence penalty | |
---|---|
Type | Slider |
In-game description | “Penalty for choosing tokens that already appear in the chat history. Positive values encourage the model to use synonyms and talk about different topics. Negative values encourage the model to repeat the same word over and over again.” |
Default value | 0 |
Minimum | -2 |
Maximum | 2 |
Increment | 0.01 |
Frequency penalty
Frequency penalty | |
---|---|
Type | Slider |
In-game description | “Similar to presence penalty, but based on how often the token appears in the chat history. Positive values encourage the model to use synonyms and talk about different topics. Negative values encourage the model to repeat existing chat messages.” |
Default value | 0 |
Minimum | -2 |
Maximum | 2 |
Increment | 0.01 |
Stop sequence
Stop sequence | |
---|---|
Type | Enum |
In-game description | “Controls how AutoComplete detects the end of a chat message. Line Break is the default value and is recommended for most language models. Next Message works better with certain code-optimized language models, which have a tendency to insert line breaks in the middle of a chat message.” |
Default value | Line Break |
Possible values | Line Break, Next Message |
Note: “certain code-optimized language models” is a reference to OpenAI's code-davinci-002
model, which worked much better when using the “Next Message” option and is unfortunately no longer available. It's possible that open source code models like StarCoder will see a similar improvement when using the “Next Message” option.
Context length
Context length | |
---|---|
Type | Slider |
In-game description | “Controls how many messages from the chat history are used to generate predictions. Higher values improve the quality of predictions, but also increase the time it takes to generate them, as well as cost (for APIs like OpenAI) or RAM usage (for self-hosted models).” |
Default value | 10 messages |
Minimum | 0 (unlimited) |
Maximum | 100 messages |
Increment | 1 message |
Filter server messages
Filter server messages | |
---|---|
Type | Checkbox |
In-game description | “Only shows player-made chat messages to the model. This can help you save tokens and get more out of a low context length, but it also means that the model will have no idea about events like players joining, leaving, dying, etc.” |
Default value | not checked |
Custom model
Custom model | |
---|---|
Type | TextField |
In-game description | “If set, this model will be used instead of the one specified in the \”OpenAI model\“ setting. Use this if you have a fine-tuned OpenAI model or if you are using a custom endpoint that is OpenAI-compatible but offers different models.” |
Default value | (empty) |
Custom model type
Custom model type | |
---|---|
Type | Enum |
In-game description | “Whether the custom model should use the chat endpoint or the legacy endpoint. If \”Custom model\“ is left blank, this setting is ignored.” |
Default value | Chat |
Possible values | Chat, Legacy |
OpenAI chat endpoint
OpenAI chat endpoint | |
---|---|
Type | TextField |
In-game description | “Endpoint for OpenAI's chat completion API.” |
Default value | https://api.openai.com/v1/chat/completions |
The “OpenAI chat endpoint” setting allows the user to use OpenAI's chat completion API through a proxy. This is necessary in some countries where OpenAI's APIs are banned.
It may also be useful for Microsoft Azure customers who have their own endpoint, but this has not been tested yet. There are subtle differences in the Azure version of the API, so it's possible that it won't work with AutoComplete.
OpenAI legacy endpoint
OpenAI legacy endpoint | |
---|---|
Type | TextField |
In-game description | “Endpoint for OpenAI's legacy completion API.” |
Default value | https://api.openai.com/v1/completions |
Max suggestions per draft
Max suggestions per draft | |
---|---|
Type | Slider |
In-game description | “How many suggestions the AI is allowed to generate for the same draft message.” |
Default value | 3 |
Minimum | 1 |
Maximum | 10 |
Increment | 1 |
The “Max suggestions per draft” setting controls how many different suggestions the AI will try to generate for the same draft message. Higher values will result in more suggestions, but will also use up more tokens and be more expensive for OpenAI API users. This setting can be useful for exploring different response options.
Setting “Max suggestions per draft” to a higher value than “Max suggestions shown” is usually not a good idea, as there will be no way to see the additional suggestions.
Max suggestions kept
Max suggestions kept | |
---|---|
Type | Slider |
In-game description | “Maximum number of suggestions kept in memory.” |
Default value | 100 messages |
Minimum | 10 messages |
Maximum | 1000 messages |
Increment | 10 messages |
The “Max suggestions kept” setting only controls at what point old suggestions are deleted from memory. Higher values don't use any additional tokens and only consume a tiny amount of RAM. This is why the range of values is so much higher than for the other settings.
Max suggestions shown
Max suggestions shown | |
---|---|
Type | Slider |
In-game description | “How many suggestions can be shown above the chat box. If this is set too high, the suggestions will obscure some of the existing chat messages. How high you can set this depends on your screen resolution and GUI scale.” |
Default value | 5 |
Minimum | 1 |
Maximum | 10 |
Increment | 1 |
The “Max suggestions shown” setting controls how many suggestions can be shown at once on the screen. Depending on the user's screen resolution and GUI scale, higher values may cause the suggestions to cover up other parts of the UI.
Setting “Max suggestions per draft” to a higher value than “Max suggestions shown” is usually not a good idea, as there will be no way to see the additional suggestions.
Changes
-
- Added AutoComplete.
-
- Fixed the description of AutoComplete's "Max tokens" setting.
-
- Added "OpenAI chat endpoint", "OpenAI legacy endpoint" and "Oobabooga endpoint" settings to AutoComplete.
- AutoComplete now supports all of OpenAI's currently available language models.
-
- Added support for gpt-3.5-turbo-1106 and gpt-4-1106-preview models to AutoComplete.
-
- Updated the list of OpenAI models in AutoComplete.
- Added "Custom model" and "Custom model type" settings to AutoComplete.
- AutoComplete's "Max suggestions per draft" setting now generates all the suggestions at once.
- AutoComplete's "Temperature" setting now defaults to 1, and "Frequency penalty" defaults to 0.
- Removed the "API provider", "Repetition penalty", and "Encoder repetition penalty" settings from AutoComplete.
-
- Updated AutoComplete's list of OpenAI models.