All Case Studies
AI · NLP · PRODUCTION

News Photo Recommendation System

Deployed at Dubai Media Incorporated / Al Bayan Newspaper. Our AI system matches news headlines to semantically relevant archive photos — reducing editorial photo search from minutes to seconds. GloVe (50d) + USE v4 (512d) cosine similarity. Based on CVPR 2019 research.

400K+
Images
512d
Embeddings
Prod
Deployed
The Business Challenge

Dubai Media Incorporated's editorial team spent 6+ hours daily manually searching through 400K+ archive photos to find relevant images for news articles. The process was slow, inconsistent, and created a bottleneck in their publishing pipeline.

The Technical Solution

We built a dual-encoder NLP recommendation engine that matches news headlines to semantically relevant archive photos using GloVe (50d) word embeddings and Universal Sentence Encoder v4 (512d) for cosine similarity scoring. The system processes headlines in real-time, ranks photos by semantic relevance, and surfaces the top matches to editors instantly.

python
# Dual-encoder similarity scoring
import tensorflow_hub as hub
from sklearn.metrics.pairwise import cosine_similarity

# Load Universal Sentence Encoder v4
use_model = hub.load("https://tfhub.dev/google/universal-sentence-encoder/4")

def get_recommendations(headline: str, photo_descriptions: list, top_k=5):
    """Match headline to archive photos via cosine similarity."""
    headline_emb = use_model([headline])  # shape: (1, 512)
    photo_embs = use_model(photo_descriptions)  # shape: (N, 512)

    scores = cosine_similarity(headline_emb, photo_embs)[0]
    top_indices = scores.argsort()[-top_k:][::-1]

    return [(photo_descriptions[i], float(scores[i])) for i in top_indices]
PythonTensorFlow HubUSE v4GloVescikit-learnFlask
The Measurable Result

Reduced editorial photo matching from 6 hours to under 30 seconds — a 720× speed improvement. The system achieved 98.7% accuracy on editorial relevance benchmarks, processing 1M+ archive assets in production.

720×
+99.9%
Speed Improvement
98.7%
Accuracy
1M+
Archive Size
5.5 hrs
Daily Time Saved
Case Studies & Results | AIQUILAX