FoodNexus: Massive Food Knowledge for Recommender Systems

48th European Conference on Information Retrieval (ECIR) 2026
FoodNexus: Massive Food Knowledge for Recommender Systems - Overview

Abstract

Personalized food recommendation can promote healthier, sustainable eating, but current systems often rely on sparse and unstructured data, limiting semantic expressiveness and diverse personalization. In this paper, we propose FoodNexus, a large-scale knowledge graph with nearly one billion triples designed to enrich food recommendation with structured, nutrition-aware, and user-contextual information. We built it via a multi-stage pipeline that combines and augments the largest public dataset of user–recipe interactions, HUMMUS, with extensive metadata from Open Food Facts by linking recipes to concrete food products, extracting user traits from their biographies and reviews, and mapping both data sources onto the same ontology. Experiments show that FoodNexus enables richer, nutrition-sensitive evaluation of recommendations.

Motivation

Personalized food recommendation systems can play a crucial role in promoting healthier and more sustainable eating habits. However, current systems often rely on sparse and unstructured data, which limits their semantic expressiveness and ability to support diverse personalization needs.

Existing datasets for food recommendation face two key limitations:

  • Lack of user-level constructs: Most datasets do not capture user preferences, dietary restrictions, or consumption traits
  • Missing product granularity: Recipes cannot be aligned with real food products for nutritional and sustainability analysis

The Problem: No existing resource comprehensively supports nutrition, sustainability, and user constraints in food recommendation. Current systems cannot evaluate whether recommendations align with users' health goals or environmental concerns.

FoodNexus Knowledge Graph

FoodNexus is a large-scale knowledge graph designed to enrich food recommendation with structured, nutrition-aware, and user-contextual information. It combines and augments the largest public dataset of user-recipe interactions (HUMMUS) with extensive metadata from Open Food Facts.

FoodNexus ontology overview
Figure 1: Visual overview of the FoodNexus ontology showing entities and their relationships.

Scale and Coverage

  • ~979.5 million triples capturing food knowledge
  • ~51 million entities across 11 entity types
  • ~130.2 million attributes with 15 attribute types
  • 11 relation types connecting entities
  • 73% of entities and 70% of relations aligned with schema.org vocabulary for interoperability
Comparison of data sources
Table 1: Comparison of feature-rich data sources for food recommendation.

Construction Pipeline

FoodNexus is built via a 5-stage pipeline that combines multiple data sources and enriches them with semantic knowledge:

Stage 1: Source Dataset Selection

We combine HUMMUS, the largest public dataset of user-recipe interactions, with Open Food Facts, a comprehensive database of food products with nutritional and sustainability information.

Stage 2: Combined Ontology Specification

We design a unified ontology that captures recipes, ingredients, products, users, and their relationships, aligned with schema.org vocabulary for interoperability.

Stage 3: Recipe-Product Entity Linking

Using the BAAI/bge-en-icl encoder with a 0.85 similarity threshold, we link recipe ingredients to concrete food products from Open Food Facts.

Stage 4: User Consumption Traits Extraction

We employ Qwen2.5 LLM to extract user traits (dietary preferences, restrictions, health goals) from their biographies and reviews.

Stage 5: Knowledge Graph Assembly

All extracted entities, relations, and attributes are assembled into a unified knowledge graph following the specified ontology.

Table 2: Relations with element names in FoodNexus.

SourceRelationTargetDescription
UserGrouppublishesRecipeRecipeA user has published a recipe
UserGrouppublishesReviewUserReviewA user has published a review
UserGrouphasConstraintTagA user has a dietary constraint
UserReviewitemReviewedRecipeA review is associated with a recipe
RecipehasIndicatorIndicatorA recipe has a certain indicator
ProducthasIndicatorIndicatorA product has a certain indicator
RecipehasPartIngredientA recipe includes a specific ingredient
IngredientisRelatedToRecipeAn ingredient is part of a recipe
TagsuitableForDietRecipeA constraint is compatible with a recipe
TagsuitableForDietProductA constraint is compatible with a product
ProductsameAsRecipeA product is similar/identical to a recipe
FoodProducerproducesProductA producer produces a product
RecipesameAsRecipeA recipe is an alternative or similar recipe
StoreoffersProductA store sells a certain product
StoreisPlaceInCityA store is located in a certain city
ProductcountryOfAssemblyCountryA product is assembled in a country

Statistics and Comparison

Table 3: Comparative statistics between original data sources and FoodNexus.

Data Source# Triples# Entities# Attributes# E. Types# R. Types# A. Types
HUMMUS~53.9M~12.3M~22.8M669
HUMMUS (inferred)~57.9M~12.3M~22.8M6714
OFF~267.9M~38.5M~107.4M767
FoodNexus (Ours)~979.5M~51.0M~130.2M111115

E. Types: Entity Types; R. Types: Relation Types; A. Types: Attribute Types

Experimental Results

We evaluate FoodNexus through two research questions:

RQ1: Recommendation Utility

We compare several recommender systems on FoodNexus to assess recommendation quality:

Table 4: Performance comparison across recommendation models with Hit, Recall, and NDCG on top-10/20/50 lists.

ModelHit@10Hit@20Hit@50Recall@10Recall@20Recall@50NDCG@10NDCG@20NDCG@50
Pop0.00350.00500.00770.00080.00100.00130.00080.00090.0009
BPR0.04740.07770.13610.01360.02440.04670.01020.01330.0189
NeuMF0.05620.08860.14910.01750.02760.05150.01310.01600.0220
LightGCN0.06040.09300.15580.01840.03070.05570.01400.01730.0235
MKR0.04130.06560.11260.01660.02720.04770.01110.01400.0188
KTUP0.04460.07370.12600.01890.03150.05520.01180.01540.0210
KGAT0.05800.09030.14530.02310.03720.06280.01520.01910.0252
UserKGAT0.01870.03680.08020.00840.01640.03700.00460.00690.0116
UserKTUP0.03880.06900.12600.01540.02950.05700.00930.01330.0198
UserMKR0.04610.07440.12640.01870.03190.05560.01200.01570.0212
  • LightGCN achieves the highest Hit scores
  • KGAT achieves the best Recall and NDCG
  • UserMKR (user-aware variant) can capture useful KG patterns when properly designed

RQ2: Nutritional and Contextual Analysis

FoodNexus enables nutrition-sensitive evaluation, revealing systematic biases in current recommender systems:

Categorical analysis heatmap
Figure 2: Average per-user difference Δ between category proportions in models' top-k lists and global category prevalence for Allergens, Packaging, and Green Score.
Nutritional analysis
Figure 3: Average per-user difference Δ for numerical nutritional attributes including Added Sugars, Saturated Fat, Potassium, Fiber, and Vitamins.
  • Gluten bias: Models tend to over-represent gluten-containing items
  • Packaging bias: Interaction-based systems over-represent plastic-packaged products
  • Sustainability skew: Systematic bias toward less sustainable items detected
  • Nutritional signals: Models under-represent vitamins and potassium while showing mixed results for sugars and fats

Key Finding: FoodNexus enables richer, nutrition-sensitive evaluation of recommendations, revealing biases that would be invisible with traditional datasets.

Key Contributions

  • Large-Scale Knowledge Graph: Nearly one billion triples linking recipes, products, users, and nutritional information
  • User Trait Extraction: Novel pipeline using LLMs to extract dietary preferences and health goals from user content
  • Recipe-Product Linking: Semantic matching to connect recipes with real food products for nutritional analysis
  • Nutrition-Aware Evaluation: Framework for assessing recommendation fairness regarding health and sustainability
  • Schema.org Alignment: High interoperability with existing semantic web standards

Resources

The FoodNexus dataset and code are publicly available:

BibTeX

@inproceedings{boratto2026foodnexus,
  author = {Boratto, Ludovico and Fenu, Gianni and Marras, Mirko and Medda, Giacomo and Zedda, Giovanni},
  title = {FoodNexus: Massive Food Knowledge for Recommender Systems},
  booktitle = {Proceedings of the 48th European Conference on Information Retrieval},
  series = {ECIR '26},
  year = {2026},
  publisher = {Springer},
  doi = {10.1007/978-3-032-21321-1_52},
  url = {https://doi.org/10.1007/978-3-032-21321-1_52}
}