Skip to content

GSoC 2023 brainstorming thread #2354

@terriko

Description

@terriko

This thread is intended for folk to brainstorm about potential project ideas for GSoC 2023. The idea is to have a separate thread from the "GSoC start here" issue (#2230) so that we have a place where we can talk about ideas that may be completely not viable. Once we get some ideas that are viable and well-described enough that a GSoC contributor could take it and run with it, we'll add them to #2230.

Some thoughts off the top of my head:

  1. Improved VEX triage tools
    • Some user tools (and usage docs) for helping people do triage on cve-bin-tool scans and produce VEX + SBOM output. Maybe even GUI based tooling for triage, hooks for an existing tool (like VSCode) that might allow for easier JSON editing already, or an ability to save from a local HTML report?
  2. Improved product representation & meta-info about products.
    • We currently just report whatever we called a thing internally in the binary scans, and whatever it was called in the file in non-binary scans such as SBOM or package list parsing, but it would be nice to include things like software heritage designations, especially to allow for de-duplication if we combine scans from multiple BOMs. We might also want to see how viable it is to provide other commonly desired meta-info like licensing, source urls, packaging data, etc.
    • You will almost certainly need to build a data format for de-dupe / meta data and allow users to be able to add to it, similar to how we have checkers right now.
    • Note: I expect this one to be hard and require a lot of heuristic work. 350hr minimum probably.
    • The goal here isn't perfection, but if we could say tag 50-70% of things with additional meta-data that might be enough.
  • Scanning the internet with cve-bin-tool.
    • Work on setting up github actions workflows and tooling to support scanning of hundreds of repos and handling problems at scale like shared triage, pretty overview graphs, monitoring, whatever else might be needed. (Imagine if you were, say, the Python Software Foundation wanting to scan every repo associated with a pypi project, or a corporation trying to scan your own public projects. what additional tools would you need to manage that much data?)

Anyhow, even "bad" ideas are worth discussing at brainstorming stage, so don't be shy even if something might not wind up viable for gsoc. Brainstorm away!

Metadata

Metadata

Assignees

No one assigned

    Labels

    gsocTasks related to our participation in Google Summer of Code

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions