MASK OFFICIAL

See and Understand.

Translate text with masks and understand visuals with VLM. Privacy-first and on-device.

Built for multilingual reading: comics, games, slides, handwritten notes, and charts.

Original

mask Result

Boom!

This prototype must be finished by next week.

What You Mask Is What You Get

Select only what matters. Translation appears in place without breaking visual continuity.

Traditional Screenshot Translation

  • Capture, upload, then switch windows
  • Text is detached from original context
  • Repeated actions slow down reading flow

mask Translation

  • Select a dialog area and translate in one step
  • Translation is blended back into the scene
  • Designed for uninterrupted reading

Core Technology Highlights

VLM Scene Understanding

Beyond OCR, mask understands visual context in comics, charts, and handwritten notes.

On-device Privacy

OCR and key visual processing run locally by default, so your images stay on your device.

Flexible Model Routing

Switch across OpenAI, Gemini, Qwen, DeepSeek, and Ollama based on speed and quality needs.

Beyond Text, Better Visual Understanding

Traditional OCR reads text only. mask VLM reads text with scene context.

Scenario Traditional Tool mask (VLM)
Handwritten notes + sketch Fragmented words, awkward output Understands intent and generates coherent translation
Comic sound effects Misses text or outputs garbled content Interprets “ドカン” and translates to “Boom!”
Data charts Translates title only Summarizes chart trends in readable language

Privacy Without Compromise

Image Input
On-device OCR / VLM
Local Translation Output

Local-first processing pipeline with no image upload by default.

Use Cases

Read Raw Comics

Translate dialog areas instantly while preserving original style.

Understand Foreign Slides

Translate text and capture chart meaning with context.

Support Foreign-language Games

Understand UI and story text in real time without context switching.

Organize Handwritten Notes

Interpret rough handwriting and sketches into readable output.

Early Feedback

“mask changed how I read foreign comics. It really understands sound effects.” - Early tester

Try mask now and turn what you see into what you understand