Fusion of Abstract Learning and Context-Optimized Neural-methods (FALCON) - SBIR Topic DPA26BZ04-DV016
Funding Amount:
$1,500,000
Deadline to Apply:
August 19th, 2026
Objective:
The goal of this effort is to combine advanced machine learning (ML) methods that can be computationally efficient in structured data with large language models (LLM) that are general and can extract context from data. The aim is to derive powerful and efficient technology for interactive statistical analysis of large-scale data seen such as in enterprise or battlefield.
Description:
By integrating the contextualization power of LLMs with the statistical power of ML, the program aims to leverage the benefits of both to provide domain-specific statistical contextualization of structured data.
This effort must:
Survey and research the emerging ML methods suitable for large scale data containing both structured and unstructured data.
Develop an architecture to combine select ML methods with LLM models.
Determine a set of metrics that encompass accuracy, new insights, computational efficiency, and ability to generalize across datasets.
Evaluate the combined architecture and methods in datasets drawn from multiple applications. This may focus on tabular data in enterprise or engineering applications.
Develop and demonstrate methods to mitigate possible hallucination in the workflow and demonstrate verifiable and reproducible analytic traces.
Demonstrate interactive analysis by incorporating new insights as they develop during the course of analysis.
PHASE I:
This topic is soliciting Direct to Phase II (DP2) proposals only.
Phase I feasibility should be demonstrated by documenting in the technical proposal the research team’s prior comprehensive research experience on emerging ML methods for structured data.
Proposals must demonstrate the research team understands their functionality, ability to scale, and advantages relative to SOTA ML methods.
Prior research should have been conducted in the last three years.
Reports which provide data, clearly present the analysis done, and provide evidence of scholarly impact will be strongly preferred.
PHASE II:
Identify one or more promising ML methods and develop the software for combining it with one or more available LLMs, preferably open source.
The implementation plan should incorporate an initial demonstration of the analysis functionality (in the first six months), and a plan to scale up to enterprise level data and interactive analysis (at the end of the first year).
The final demonstrations are to be done with at least two datasets from different areas.
The evaluation of the combined method should demonstrate accuracy relative to ground truth as well document the improvement over SOTA methods (ML only, and LLM only).
This Phase should have a parallel effort on the commercialization strategy implementation and a go-to-market plan.
Phase II fixed payable milestones for this program should include:
Base
Month 1: Kick off and technical report on the ML methods to be implemented.
Month 3: Quarterly meeting presentation material, including demonstration of progress to date and plans.
Month 6: Quarterly meeting presentation material, including demonstration of progress to date, plans overall, and an initial demonstration of the analysis functionality.
Month 11: Phase deliverable to demonstrate ability to scale up to enterprise level data and interactive analysis with at least two datasets from different areas.
Month 12: Quarterly/Phase meeting presentation material on progress over base year. Phase II Final report and software delivery, with suitable documentation.
Option
Month 15: Quarterly meeting presentation material, including demonstration of technical and commercialization progress to date and plans.
Month 18: Quarterly/Phase meeting presentation material on progress over Option. Option Final report and software delivery, with suitable documentation.
PHASE III DUAL USE APPLICATIONS:
The effort should deliver technology that is quantitatively superior to SOTA methods in both civilian and military sectors.
Modern commercial and scientific applications from business enterprises to biology labs which have structured data along with unstructured text are potential applications.
Military strategic and tactical applications also have to deal with structured tabular data in design and manufacturing as well as in analyzing economic, social, and geographic data.
Who will win?
If you can achieve the objective above better than any other company on the market, you have a very high-likelihood of success and should apply.
Who is eligible to apply?
Any company that meets the following criteria:
For-profit company
U.S.-owned and controlled.
500 or fewer employees (including affiliates)
How Can BW&CO Help?
1) End-to-end support including, strategy, writing of the full proposal, and administrative & compliance support.
2) Proposal strategy and review.
3) Administrative & compliance support.
Request to talk with a member of our team by completing the form below: