Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
51 commits
Select commit Hold shift + click to select a range
46cf81c
add openai api support
future-xy Feb 29, 2024
87c3e28
add test scripts, update readme, update api
future-xy Mar 10, 2024
ee45093
Fix: Undefined Symbol Compilation Error (#37)
drunkcoding Feb 15, 2025
cca059c
Refactor code for better performance (#38)
drunkcoding Feb 23, 2025
087c68e
CI: add pre commit format ci (#40)
lausannel Feb 26, 2025
c57995e
Chore: rename organization name & optimize CI (#41)
drunkcoding Mar 5, 2025
4a44e6e
CI: fix not a git repository in CI (#43)
lausannel Mar 5, 2025
244b2a4
CI: fix missing sudo in apt install (#44)
lausannel Mar 5, 2025
938c38b
CI: fix missing sudo (#45)
lausannel Mar 5, 2025
ce036fc
CI: revert os matrix in CI (#46)
lausannel Mar 5, 2025
f79839c
CI: add missing apt update after installing deb file (#47)
lausannel Mar 5, 2025
c762363
Doc: Update README example to DeepSeek and Suppress Warning (#49)
drunkcoding Mar 16, 2025
f4505e5
CI: do not build test if document update (#52)
drunkcoding Mar 17, 2025
ba9d66f
Merge branch 'main' into feature/openai_api
Mar 22, 2025
9045494
format and change to deepseek in example
Mar 22, 2025
72c641e
fix format
Mar 22, 2025
c218025
remove unused files
Mar 22, 2025
7b97703
fix api server token id device
Mar 22, 2025
ac92327
feat: Introduce Local Server for OpenAI-Compatible APIs (#4)
future-xy Mar 22, 2025
9906513
fix gen broken
Mar 23, 2025
5c87fe9
update readme links
Mar 23, 2025
9257e81
cancel concurrent job
Mar 23, 2025
18d08aa
set dense node to device
Mar 23, 2025
cc25124
sparse node set cpu
Mar 23, 2025
9d0b4d8
Merge branch 'main' into feature/openai_api
drunkcoding Mar 23, 2025
de0ebf5
remove OS def
Mar 23, 2025
ba35284
Merge branch 'feature/openai_api' of github.com:TorchMoE/MoE-Infinity…
Mar 23, 2025
128c30f
use update to date clang-format
Mar 23, 2025
e5f625f
fix setuptools version
Mar 23, 2025
48324d8
fix setuptools version for python 3.8
Mar 23, 2025
f73e5b0
keep single cuda version in publish
Mar 23, 2025
cd35a21
feat: set parameter to device before serving (#56)
drunkcoding Mar 23, 2025
fe81a87
add max length in gen openai
Mar 27, 2025
845e89d
fix cache race condition
Apr 16, 2025
ef028d8
all param init at host
Apr 21, 2025
eb0bb11
add qwen3
May 1, 2025
50c9b65
Merge branch 'feature/openai_api' into feature/qwen
May 2, 2025
5c7e368
ubuntu lts and build
May 10, 2025
cde7d3b
pre-commit ubuntu version
May 10, 2025
ea2f3b3
router weights update overlap
May 11, 2025
5017bcc
rename deepseek_v2 and reduce torch kernel launch
May 11, 2025
042b2ee
fix import
May 11, 2025
8d190e9
fix build and fix bug
May 12, 2025
d902eca
fix citation linebreak
May 18, 2025
1a5e10f
fix typo
Jun 14, 2025
7916de6
fix dtype size
Jun 14, 2025
93bf9ad
remove comments
Jun 14, 2025
33932d0
fix example
Jun 14, 2025
823d393
pr update init
Jun 15, 2025
afd0bd1
remove comment and unify deepseek preroute
Jun 16, 2025
d4e80c3
feat: Merge kernels from vLLM and FlashInfer (#63)
drunkcoding Feb 16, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
158 changes: 15 additions & 143 deletions .clang-format
Original file line number Diff line number Diff line change
@@ -1,71 +1,21 @@
---
# Refer to the following link for the explanation of each params:
# http://releases.llvm.org/8.0.0/tools/clang/docs/ClangFormatStyleOptions.html
Language: Cpp

BasedOnStyle: Google
AccessModifierOffset: -4
AlignAfterOpenBracket: Align
UseTab: Never
IndentWidth: 2
ColumnLimit: 80

# Force pointers to the type for C++.
DerivePointerAlignment: false
PointerAlignment: Left

# Reordering #include statements can (and currently will) introduce errors
SortIncludes: false

# Style choices
AlignConsecutiveAssignments: false
AlignConsecutiveDeclarations: false
AlignEscapedNewlines: Left
AlignOperands: true
AlignTrailingComments: true
AllowAllParametersOfDeclarationOnNextLine: false
AllowShortBlocksOnASingleLine: true
AllowShortCaseLabelsOnASingleLine: true
AllowShortFunctionsOnASingleLine: All
AllowShortIfStatementsOnASingleLine: true
AllowShortLoopsOnASingleLine: true
# This is deprecated
AlwaysBreakAfterDefinitionReturnType: None
AlwaysBreakAfterReturnType: None
AlwaysBreakBeforeMultilineStrings: true
AlwaysBreakTemplateDeclarations: true
BinPackArguments: false
BinPackParameters: false
BraceWrapping:
AfterClass: false
AfterControlStatement: false
AfterEnum: false
AfterFunction: false
AfterNamespace: false
AfterObjCDeclaration: false
AfterStruct: false
AfterUnion: false
AfterExternBlock: false
BeforeCatch: false
BeforeElse: false
IndentBraces: false
# disabling the below splits, else, they'll just add to the vertical length of source files!
SplitEmptyFunction: false
SplitEmptyRecord: false
SplitEmptyNamespace: false
BreakBeforeBinaryOperators: None
BreakBeforeBraces: WebKit
BreakBeforeInheritanceComma: false
BreakInheritanceList: BeforeColon
BreakBeforeTernaryOperators: true
BreakConstructorInitializersBeforeComma: false
BreakConstructorInitializers: BeforeColon
BreakAfterJavaFieldAnnotations: false
BreakStringLiterals: true
ColumnLimit: 100
CommentPragmas: '^ IWYU pragma:'
CompactNamespaces: false
ConstructorInitializerAllOnOneLineOrOnePerLine: true
# Kept the below 2 to be the same as `IndentWidth` to keep everything uniform
ConstructorInitializerIndentWidth: 2
ContinuationIndentWidth: 2
Cpp11BracedListStyle: true
DerivePointerAlignment: false
DisableFormat: false
ExperimentalAutoDetectBinPacking: false
FixNamespaceComments: true
ForEachMacros:
- foreach
- Q_FOREACH
- BOOST_FOREACH
IncludeBlocks: Preserve
IndentPPDirectives: BeforeHash

IncludeCategories:
- Regex: '^<ext/.*\.h>'
Priority: 2
Expand All @@ -75,81 +25,3 @@ IncludeCategories:
Priority: 2
- Regex: '.*'
Priority: 3
IncludeIsMainRegex: '([-_](test|unittest))?$'
IndentCaseLabels: true
IndentPPDirectives: None
IndentWidth: 2
IndentWrappedFunctionNames: false
JavaScriptQuotes: Leave
JavaScriptWrapImports: true
KeepEmptyLinesAtTheStartOfBlocks: false
MacroBlockBegin: ''
MacroBlockEnd: ''
MaxEmptyLinesToKeep: 1
NamespaceIndentation: None
ObjCBinPackProtocolList: Never
ObjCBlockIndentWidth: 4
ObjCSpaceAfterProperty: false
ObjCSpaceBeforeProtocolList: true
PenaltyBreakAssignment: 4
PenaltyBreakBeforeFirstCallParameter: 1
PenaltyBreakComment: 300
PenaltyBreakFirstLessLess: 120
PenaltyBreakString: 1000
PenaltyBreakTemplateDeclaration: 10
PenaltyExcessCharacter: 1000000
PenaltyReturnTypeOnItsOwnLine: 200
PointerAlignment: Left
RawStringFormats:
- Language: Cpp
Delimiters:
- cc
- CC
- cpp
- Cpp
- CPP
- 'c++'
- 'C++'
CanonicalDelimiter: ''
- Language: TextProto
Delimiters:
- pb
- PB
- proto
- PROTO
EnclosingFunctions:
- EqualsProto
- EquivToProto
- PARSE_PARTIAL_TEXT_PROTO
- PARSE_TEST_PROTO
- PARSE_TEXT_PROTO
- ParseTextOrDie
- ParseTextProtoOrDie
CanonicalDelimiter: ''
BasedOnStyle: google
# Enabling comment reflow causes doxygen comments to be messed up in their formats!
ReflowComments: true
SortIncludes: true
SortUsingDeclarations: true
SpaceAfterCStyleCast: false
SpaceAfterTemplateKeyword: true
SpaceBeforeAssignmentOperators: true
SpaceBeforeCpp11BracedList: false
SpaceBeforeCtorInitializerColon: true
SpaceBeforeInheritanceColon: true
SpaceBeforeParens: ControlStatements
SpaceBeforeRangeBasedForLoopColon: true
SpaceInEmptyParentheses: false
SpacesBeforeTrailingComments: 2
SpacesInAngles: false
SpacesInContainerLiterals: true
SpacesInCStyleCastParentheses: false
SpacesInParentheses: false
SpacesInSquareBrackets: false
Standard: Cpp11
StatementMacros:
- Q_UNUSED
- QT_REQUIRE_VERSION
# Be consistent with indent-width, even for people who use tab for indentation!
TabWidth: 2
UseTab: Never
75 changes: 75 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@
name: Bug Report
description: File an issue about a bug in MoE-Infinity.
title: "[BUG] "
labels: [bug]
assignees: []
body:
- type: markdown
attributes:
value: |
Please provide as much detail as possible to help us address the issue efficiently. If you're unsure if this is a bug, consider asking by creating an issue.

- type: checkboxes
id: prerequisites
attributes:
label: Prerequisites
options:
- label: I have read the [MoE-Infinity documentation]().
required: true
- label: I have searched the [Issue Tracker](https://github.com/EfficientMoE/MoE-Infinity/issues) to ensure this hasn't been reported before.
required: true

- type: textarea
id: system-info
attributes:
label: System Information
description: Please provide details about your environment (OS, Python version, GPU, etc.).
validations:
required: true

- type: textarea
id: description
attributes:
label: Problem Description
description: Provide a clear description of the bug.
validations:
required: true

- type: textarea
id: reproduction
attributes:
label: Steps to Reproduce
description: Please provide code snippets and steps to reproduce the issue.
value: |
Code snippets:
```python

```

Steps to reproduce:
1.
2.
3.
validations:
required: true

- type: textarea
id: expected
attributes:
label: Expected Behavior
description: What did you expect to happen?

- type: textarea
id: additional-context
attributes:
label: Additional Context
description: Add any other relevant information, screenshots, or suggested fixes.

- type: textarea
id: statistics
attributes:
label: Usage Statistics (Optional)
description: |
We'd love to know how you're using MoE-Infinity! If you're comfortable, please share details like your affiliation and use case to help us improve the project.
validations:
required: false
62 changes: 62 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,62 @@
name: Feature Request
description: Suggest an idea for MoE-Infinity
title: "[Feature Request] "
labels: [enhancement]
assignees: [futurexy]
body:
- type: checkboxes
id: prerequisites
attributes:
label: Prerequisites
options:
- label: I have searched existing issues and reviewed documentation.
required: true

- type: textarea
id: problem
attributes:
label: Problem Description
description: Is your feature request related to a problem? Please describe.
placeholder: I'm always frustrated when [...]
validations:
required: true

- type: textarea
id: solution
attributes:
label: Proposed Solution
description: Describe the solution you'd like.
validations:
required: true

- type: textarea
id: alternatives
attributes:
label: Alternatives Considered
description: Describe any alternative solutions or features you've considered.

- type: textarea
id: context
attributes:
label: Additional Context
description: Add any other context, examples, or screenshots about the feature request here.

- type: dropdown
id: importance
attributes:
label: Importance
options:
- Nice to have
- Important
- Critical
validations:
required: true

- type: textarea
id: statistics
attributes:
label: Usage Statistics (Optional)
description: |
We'd love to know how you're using MoE-Infinity! If you're comfortable, please share details like your affiliation and use case to help us improve the project.
validations:
required: false
17 changes: 17 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
## Description
Briefly describe your changes.

## Motivation
Explain why this change is needed and what problem it solves.
If it fixes an issue, link it (e.g., `close #123`).

## Type of Change
- [ ] Bug fix
- [ ] New feature
- [ ] Breaking change
- [ ] Documentation update

## Checklist
- [ ] I have read the [CONTRIBUTION](https://github.com/EfficientMoE/MoE-Infinity/blob/main/CONTRIBUTING.md) guide.
- [ ] I have updated the tests (if applicable).
- [ ] I have updated the documentation (if applicable).
Loading