HiCellTek HiCellTek
Back to blog
Video MOSITU-T P.1204.3QoEVideo

Mobile video quality assessment with ITU-T P.1204.3

How ITU-T P.1204.3 parametric video MOS model works for mobile video QoE assessment. Bitstream analysis, no-reference scoring, and integration for 5G network monitoring.

Takwa Sebai
Takwa Sebai
Founder & CEO, HiCellTek
March 8, 2026 Β· 5 min read

Video traffic now dominates mobile networks. On 5G networks, video streaming accounts for the majority of downlink data volume, and user expectations for quality are higher than ever. Yet most drive test and network monitoring tools still treat video as a throughput problem β€” measuring download speed rather than the actual quality perceived by the viewer.

ITU-T P.1204.3 changes this. It is a standardized parametric model that estimates video MOS (Mean Opinion Score) directly from the video bitstream, without needing a reference video. This article explains how it works, why it matters for mobile operators, and how to deploy it in the field.

Why video QoE measurement matters on mobile

Traditional network KPIs β€” throughput, latency, packet loss β€” are necessary but insufficient for understanding video quality. A user streaming a 1080p video on a 5G connection may experience:

  • Rebuffering due to intermittent radio conditions, even with high average throughput
  • Resolution downscaling triggered by adaptive bitrate (ABR) algorithms reacting to bandwidth fluctuations
  • Codec artifacts from aggressive compression at low bitrates
  • Initial loading delay caused by TCP slow start or DNS resolution

These degradations directly impact the viewer’s perceived quality, but they do not appear in standard RF KPI dashboards. A video MOS score captures the aggregate impact of all these factors into a single, interpretable metric on the familiar 1-to-5 scale.

How ITU-T P.1204.3 works

P.1204.3 is part of the ITU-T P.1204 family of video quality models. Specifically, it is the mode 3 (bitstream) model, meaning it analyzes the encoded video bitstream to estimate quality β€” it does not require pixel-level comparison with a reference video.

Input data

The model takes as input:

  • Video codec metadata: codec type (H.264, H.265/HEVC, VP9, AV1), profile, level
  • Frame-level statistics: frame size, frame type (I/P/B), quantization parameter (QP)
  • Temporal information: frame rate, frame drops, rebuffering events
  • Resolution: spatial resolution and any mid-stream resolution changes (ABR switching)

All of this information is available from the video decoder without requiring access to the original uncompressed video.

Scoring process

  1. Per-frame quality estimation: the model estimates the visual quality of each frame based on its QP, codec, and resolution
  2. Temporal pooling: frame-level scores are aggregated over time, with higher weight given to quality drops (the human visual system is more sensitive to degradation than to improvement)
  3. Rebuffering penalty: stalling events are heavily penalized, as rebuffering is the single most impactful factor on video QoE
  4. Final MOS output: a score from 1.0 (unwatchable) to 5.0 (excellent), calibrated against subjective test data

Key advantages of the bitstream approach

PropertyP.1204.3 (bitstream)Full-reference (e.g., VMAF)
Reference video neededNoYes
CPU costLow (metadata only)High (pixel comparison)
Real-time capableYesDifficult on mobile
Codec-awareYesYes
Rebuffering detectionYesVaries
StandardizationITU-TNetflix (open-source)

The no-reference, low-CPU design makes P.1204.3 uniquely suited for mobile deployment. It can run alongside the video player on the device without degrading playback performance.

Use cases for mobile operators

1. QoE-driven network optimization

Instead of optimizing for peak throughput, operators can optimize for video MOS. This shifts the focus to what matters: consistent quality at the bitrates users actually consume. A cell sector with 200 Mbps peak but frequent micro-interruptions may score worse on video MOS than a sector delivering stable 30 Mbps.

2. Video streaming benchmarking

Operators running competitive benchmarks can measure video MOS alongside traditional KPIs. This provides a direct, comparable metric for β€œwhich operator delivers the best YouTube/Netflix/TikTok experience in this city.”

3. ABR algorithm validation

When operators deploy or tune ABR policies on their video optimization proxies, P.1204.3 provides objective feedback on whether the changes actually improved perceived quality or introduced unnecessary resolution switching.

4. 5G service differentiation

5G operators positioning premium video tiers (4K, low-latency gaming streams) need QoE metrics to validate that the premium slice or QoS policy delivers measurably better video quality than best-effort traffic.

Integration architecture

A practical P.1204.3 deployment on Android involves the following components:

  1. Video player integration: hook into the media pipeline (e.g., Media3/ExoPlayer) to capture frame-level statistics, codec metadata, and rebuffering events
  2. P.1204.3 scoring engine: process the collected metadata through the ITU-T model to produce per-segment MOS scores
  3. RF context tagging: correlate each MOS score with the concurrent radio conditions β€” serving cell, RSRP, SINR, band, NR/LTE anchor β€” to enable root-cause analysis
  4. Data export: write MOS time series to the log file alongside Layer 3 and RF data for post-processing

This architecture ensures that a low video MOS score can be traced back to a specific cell, a specific time window, and a specific RF condition β€” actionable information for the optimization team.

Interpreting video MOS scores

MOS rangePerceived qualityTypical cause
4.5 - 5.0ExcellentStable high bitrate, no stalling
3.5 - 4.5GoodMinor ABR switching, no stalling
2.5 - 3.5FairFrequent resolution drops or short stalls
1.5 - 2.5PoorMultiple rebuffering events, low resolution
1.0 - 1.5UnwatchableContinuous stalling or playback failure

Operators should target a minimum video MOS of 3.5 for mainstream streaming services. For premium 5G tiers, a threshold of 4.0+ is appropriate.

How HiCellTek implements video MOS

HiCellTek’s MOS Video SDK integrates ITU-T P.1204.3 with Media3/ExoPlayer on Android. The SDK hooks into the video decoder pipeline to capture frame statistics in real time, runs the P.1204.3 model on-device, and produces per-segment video MOS scores synchronized with the RF measurement timeline.

No server-side processing is required. No reference video is needed. The video MOS scores are written to the same HLOG file as Layer 3 messages and RF KPIs, enabling unified post-analysis in the HiCellTek desktop tool.

Conclusion

Video QoE is no longer optional for mobile network operators β€” it is the primary experience that end users judge their service by. ITU-T P.1204.3 provides a standardized, lightweight, no-reference method to quantify video quality directly on the device, in real time, during actual streaming sessions.

HiCellTek’s MOS Video SDK delivers P.1204.3 integration for Android at 4,490 EUR/year, including Media3/ExoPlayer hooks, on-device scoring, and RF context correlation. Explore the full capabilities and licensing on the pricing page.

Share: LinkedIn X
Takwa Sebai
Takwa Sebai

Founder of HiCellTek. 15+ years in telecom, operator side, vendor side, field side. Building the field tool RF engineers deserve.

Ready for the field?

Request a personalized demo of HiCellTek β€” 2G/3G/4G/5G network diagnostics on Android.

Try our free telecom tools

TAC Lookup, IMEI Calculator, EARFCN Calculator, used by telecom engineers worldwide.

Try Free Tools

Get telecom engineering insights. No spam, ever.

Unsubscribe in one click. Data processed in the EU.