-
Notifications
You must be signed in to change notification settings - Fork 1k
Add InferX provider (OpenAI-compatible endpoints) #1843
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
Prashanth-InferX
wants to merge
19
commits into
anomalyco:dev
Choose a base branch
from
Prashanth-InferX:patch-1
base: dev
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
19 commits
Select commit
Hold shift + click to select a range
fcb9999
Create provider.toml
Prashanth-InferX 50eeac2
Create qwen3-coder-next-fp8.toml
Prashanth-InferX de3f51e
Create qwen3-coder-next-fp8-1m.toml
Prashanth-InferX 2ee373f
Update qwen3-coder-next-fp8-1m.toml
Prashanth-InferX 353af73
Create qwen3.5-122b-a10b-nvfp4.toml
Prashanth-InferX bd985e6
Create qwen3.6-27b-fp8.toml
Prashanth-InferX f5b2ae2
Create qwen3.6-35b-a3b-fp8.toml
Prashanth-InferX cdfee03
Create gemma-4-31b-it-fp8.toml
Prashanth-InferX 16b7bff
Create gemma-4-31b-it-fp8.toml
Prashanth-InferX 3163eae
Delete providers/inferx/models/qwen/gemma-4-31b-it-fp8.toml
Prashanth-InferX 6d14378
Update gemma-4-31b-it-fp8.toml
Prashanth-InferX ba9e5f5
Update qwen3-coder-next-fp8-1m.toml
Prashanth-InferX 8f9ad03
Update qwen3-coder-next-fp8.toml
Prashanth-InferX 39163ab
Update qwen3-coder-next-fp8-1m.toml
Prashanth-InferX 7d52d15
Update qwen3-coder-next-fp8.toml
Prashanth-InferX 0057a21
Add InferX logo
Prashanth-InferX e239a17
Update provider.toml
Prashanth-InferX b232252
Update logo.svg
Prashanth-InferX 8f449a6
Update provider.toml
Prashanth-InferX File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,21 @@ | ||
| name = "Gemma 4 31B IT FP8" | ||
| family = "gemma" | ||
| release_date = "2026-05" | ||
| last_updated = "2026-05-05" | ||
| attachment = false | ||
| reasoning = false | ||
| temperature = true | ||
| tool_call = true | ||
| open_weights = true | ||
|
|
||
| [cost] | ||
| input = 0.0 | ||
| output = 0.0 | ||
|
|
||
| [limit] | ||
| context = 262144 | ||
| output = 8192 | ||
|
|
||
| [modalities] | ||
| input = ["text"] | ||
| output = ["text"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,21 @@ | ||
| name = "Qwen3 Coder Next FP8 1M" | ||
| family = "qwen" | ||
| release_date = "2026-05" | ||
| last_updated = "2026-05-12" | ||
| attachment = false | ||
| reasoning = true | ||
| temperature = true | ||
| tool_call = true | ||
| open_weights = true | ||
|
|
||
| [cost] | ||
| input = 0.0 | ||
| output = 0.0 | ||
|
|
||
| [limit] | ||
| context = 1024000 | ||
| output = 8192 | ||
|
|
||
| [modalities] | ||
| input = ["text"] | ||
| output = ["text"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,21 @@ | ||
| name = "Qwen3 Coder Next FP8" | ||
| family = "qwen" | ||
| release_date = "2026-05" | ||
| last_updated = "2026-05-08" | ||
| attachment = false | ||
| reasoning = true | ||
| temperature = true | ||
| tool_call = true | ||
| open_weights = true | ||
|
|
||
| [cost] | ||
| input = 0.0 | ||
| output = 0.0 | ||
|
|
||
| [limit] | ||
| context = 256144 | ||
| output = 8192 | ||
|
|
||
| [modalities] | ||
| input = ["text"] | ||
| output = ["text"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,21 @@ | ||
| name = "Qwen3.5 122B A10B NVFP4" | ||
| family = "qwen3.5" | ||
| release_date = "2026-05" | ||
| last_updated = "2026-05-13" | ||
| attachment = false | ||
| reasoning = true | ||
| temperature = true | ||
| tool_call = true | ||
| open_weights = true | ||
|
|
||
| [cost] | ||
| input = 0.0 | ||
| output = 0.0 | ||
|
|
||
| [limit] | ||
| context = 256144 | ||
| output = 8192 | ||
|
|
||
| [modalities] | ||
| input = ["text"] | ||
| output = ["text"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,21 @@ | ||
| name = "Qwen3.6 27B FP8" | ||
| family = "qwen3.6" | ||
| release_date = "2026-05" | ||
| last_updated = "2026-05-13" | ||
| attachment = false | ||
| reasoning = true | ||
| temperature = true | ||
| tool_call = true | ||
| open_weights = true | ||
|
|
||
| [cost] | ||
| input = 0.0 | ||
| output = 0.0 | ||
|
|
||
| [limit] | ||
| context = 262144 | ||
| output = 8192 | ||
|
|
||
| [modalities] | ||
| input = ["text"] | ||
| output = ["text"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,21 @@ | ||
| name = "Qwen3.6 35B A3B FP8" | ||
| family = "qwen3.6" | ||
| release_date = "2026-05" | ||
| last_updated = "2026-05-13" | ||
| attachment = false | ||
| reasoning = true | ||
| temperature = true | ||
| tool_call = true | ||
| open_weights = true | ||
|
|
||
| [cost] | ||
| input = 0.0 | ||
| output = 0.0 | ||
|
|
||
| [limit] | ||
| context = 262000 | ||
| output = 8192 | ||
|
|
||
| [modalities] | ||
| input = ["text"] | ||
| output = ["text"] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,5 @@ | ||
| name = "InferX" | ||
| env = ["INFERX_API_KEY"] | ||
| npm = "@ai-sdk/openai-compatible" | ||
| api = "https://model.inferx.net/v1" | ||
| doc = "https://inferx.net" | ||
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
no logo?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added logo.svg per spec and updated provider.toml accordingly. Thank you so much.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks again. I addressed the logo feedback and fixed the validation issue as well. now waiting for workflow approval to rerun checks when you have a moment.