Skip to content

Conversation

@pawelf-cerebras
Copy link
Contributor

Added CePO as an optional method:

  • Added CePO implementation
  • Updated README to document parameters
  • Added Cerebras API as optional inference provider

@CLAassistant
Copy link

CLAassistant commented Jan 22, 2025

CLA assistant check
All committers have signed the CLA.

@codelion
Copy link
Member

codelion commented Jan 22, 2025

Hi, thanks for contributing your changes upstream.

Can we organize the code in its own cepo folder under /optillm/cepo/

You can put a detailed README.md in that folder as well along with the configs/

We can keep the main results table under the https://github.com/codelion/optillm?tab=readme-ov-file#sota-results-on-benchmarks-with-optillm and move the rest of the details into its own README.

pawelf-cerebras and others added 28 commits January 23, 2025 15:06
This will allow users to do the following:
1. Cli where if they can pass in anything that's "cepo_<name-of-attribute>"
2. Yaml file where if they pass it in as "<name-of-attribute>"
3. If none of them have a specific attribute, we use the default setting
4. If both of them have the specific attribute, we error out
Added a license header
Added a link to view the file changes
Added a license header
Added a license header
@pawelf-cerebras
Copy link
Contributor Author

Thank you, @codelion! Certainly! I moved the code, config, and the CePO part of README into optillm/cepo, I only left the results for CePO vs base models in the SOTA results section.

@codelion
Copy link
Member

Looks, good. I will merge the PR. Thank you for your contributions. I will also cut a new release of the PYPI package with these changes later today.

@codelion codelion merged commit 63b25fb into algorithmicsuperintelligence:main Jan 24, 2025
1 check passed
@pawelf-cerebras pawelf-cerebras deleted the cepo_rc branch September 18, 2025 18:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants