Convert File Formats -
Convert Archive, Audio, Caption, Computer Aided Design, Data, Document, eBook, Email, Feed, Font, Image, Localization, Presentation, Project and Task, Spreadsheet, Subtitle, Syndication, Vector and Video files.
Supported File Types
Language Studio supports a wide range of file formats. Convert one or more files of the same type to another type by simply dragging and dropping (or uploading) your files and selecting the output format.
New file types are added on a regular basis. The below file type categories are supported in the current release:
- Packing: 7z, XZ, BZIP2, GZIP, TAR, ZIP and WIM
- Unpacking only: 7z, XZ, BZIP2, GZIP, TAR, ZIP, WIM, APFS, AR, ARJ, CAB, CHM, CPIO, CramFS, DMG, EXT, FAT, GPT, HFS, IHEX, ISO, LZH, LZMA, MBR, MSI, NSIS, NTFS, QCOW2, RAR, RPM, SquashFS, UDF, UEFI, VDI, VHD, VHDX, VMDK, WIM, XAR and Z.
|7Z||7-Zip Compressed File|
|ARJ||Archiver Robert Jung|
|BZIP2||Burrows–Wheeler algorithm, compressed archive file|
|CHM||Compiled HTML Format|
|FAT||File Allocation Table|
|HFS||Hierarchical System Format|
|ISO||International Organization for Standardization|
|LHZ||Lempel-Ziv-Huffman, compressed archive file|
|LZMA||Lempel-Ziv-Markov chain Algorithm, compressed archive file|
|NSIS||Nullsoft Scriptable Install System, compressed archive file|
|NTFS||New Technology File System, compressed archive file|
|RAR||WinRAR Compressed Archive|
|RPM||Red Hat Package Manager|
|TAR||Tape Archive File|
|UDF||Universal Disk Format File|
|WIM||Windows Imaging Format|
|ZIP||Compressed Archive File|
An audio file is a file format for storing digital audio data on a computer system. The bit layout of the audio data (excluding metadata) is called the audio coding format and can be uncompressed, or compressed to reduce the file size (often using lossy compression). The data can be a raw bitstream in an audio coding format, but it is usually embedded in a container format or an audio data format with defined storage layer.
Convert between audio formats or extract audio from any video file.
|AAC||Advanced Audio Coding|
|AC3||AC3 Audio File|
|AIFF||Audio Interchange File Format|
|ALAC||Apple’s Lossless Audio Codec|
|AU||Sun Microsystems AU File Format|
|AVR||Audio Visual Research File Format|
|CAF||Core Audio Format|
|DSD||Direct Stream Digital|
|FAP||PARIS Audio File|
|FLAC||Free Lossless Audio Codec|
|GSM||GSM 06.10 Lossy Speech Compression|
|M4A||Compressed Audio File Format|
|M4R||MPEG 4 Ringtones Audio|
|MP2||MPEG-1/2 Audio Layer 2|
|MP3||MP3 Audio Format|
|NIST||NIST (National Institute of Standards and Technology) File Format|
|OGG||Ogg Vorbis Compressed Audio Fiel|
|VOC||Sound Blaster VOC File Format|
|WMA||Windows Media Audio|
|WAV||Waveform Audio File Format|
|WV||WavPack Lossless Audio Compression|
Computer-Aided Design (CAD)
Computer Aided Design (CAD) is a form of digital design that allows users to create, modify and manage 2D and 3D models. A CAD file contains a blueprint, schematic, technical drawing or 2D/3D rendering of an object. CAD data exchange file formats are used to exchange drawing data and to translate between different Computer-aided design (CAD) authoring systems or between CAD and other downstream CAx systems. Many companies use different CAD systems and exchange CAD data files with suppliers, customers, and subcontractors.
There are many different file formats available for CAD files, but the most common formats are DWG (AutoCAD), DXF (AutoCAD), IGES (Inventor), STL (3D Systems) and SVG (Scalable Vector Graphics).
A data file is a computer file which stores data to be used by a computer application or system, including input and output data. Many computer applications use XML and JSON data files. For simple data storage, CSV or TSV are also a very common formats that are easily imported into a spreadsheet tool such as Microsoft Excel. The most simple data file is a TXT (plain text) file where the data is not structured and is simple text only.
A document file contains formatted information that includes layout, fonts, styles (bold, italic, underline, etc.), page margins, headings, bullets and numbers, diagrams, images, etc. Some document formats are used in word processors such as Microsoft Word or Google Docs, while others such as PDF (Portable Document Format) are intended for publication and are not normally editable.
Formats such as PDF can be challenging to convert to editable formats such as Microsoft Word as all the editing information has been removed and therefore must be reverse engineered in order to produce an editable document. Language Studio uses AI to determine the structure of table borders, page margins, headers, footers, and other document structural elements.
Other Microsoft Office formats are also supported, including Presentations and Spreadsheets.
|CHM||Compiled HTML Format|
|DOC||Microsoft Word DOC Format|
|DOCM||Microsoft Word DOC Format Macro enabled|
|DOCX||Microsoft Word DOCX Format|
|DOT||Microsoft Word DOC Template files|
|DOTX||Microsoft Word DOCX Template files|
|HTML||Hypertext Markup Langauge|
|ODT||OpenDocument Text File format|
|OTT||Template Documents in compliance with OASIS’ OpenDocument Standard Format|
|PCL||Printer Command Language|
|Portable Document Format|
|Portable Document Format -Standard|
|PS||PostScript (a general purpose page description langauge)|
|PUB||Microsoft Publisher Document|
|RTF||Rich Text Format|
|TEX||LaTeX Document Format|
|WP/WP6/WP7||Word Perfect Format|
|WPT||Word Perfect Template|
|XHTML||Text Based File with Markup in XML|
|XPS||XML Paper Specification|
An eBook (short for electronic book), also known as an e-book or an ebook, is a book publication made available in digital form, consisting of text, images, or both, readable on the flat-panel display of a computer or other electronic device. Although sometimes defined as “an electronic version of a printed book”, some eBooks exist without a printed equivalent. eBooks can be read on dedicated e-reader devices, but also on any computer device that features a controllable viewing screen, including desktop computers, laptops, tablets and smartphones.
|EPUB||Electronic Publisher Format|
Email is a common means of communication for businesses and individuals alike and Email clients are used to send and receive email messages, as well as manage other aspects of your email account. The email clients use a variety of email file formats to store their various data including email messages, attachments, folders and address books, etc.
There are several common email file types. The most common type of file extension associated with email is EML (Outlook Express Message File). PST (Microsoft Outlook’s Personal Storage Table) is a popular format that stores many emails in one place for quick and retrieval. Emails may be stored with attachments embedded and are typically formatted as HTML and plain text within the format of each specific file type.
Feed / Syndication
A web/news feed or syndication is a data format used for providing users with frequently updated content. Content distributors syndicate a web feed, allowing users to subscribe a channel to it by adding the feed resource address to a news aggregator client / feed reader / news reader.
|ANPA||IPTC ANPA News Wire feed format|
|ATOM||Atom Syndication Format|
|RSS||Rich Site Summary|
Fonts are a crucial part of the way we interact with computers. The file formats that allow us to store and access fonts have been improving steadily over time, but the latest versions are truly impressive. Modern fonts can handle complex scripts like Arabic and Hebrew, support multiple languages at once (like English plus Chinese), provide more accurate metrics for shaping glyphs based on their intended usage (for example in small sizes or large sizes), and they can also have multiple colors.
Popular file formats include TTF (TrueType Font), OTF (OpenType Font), CFF (CFF Font), and EOT (Embedded OpenType).
Image file types are classified into raster image formats and vector image formats. The files in this category are raster images – digital images that comprise of pixels data for representation of colors. There are two main types of raster image formats – lossless and lossy. Lossless means that no information is lost when the file is opened and viewed — the same as it was when it was first created; whereas lossy means that some information has been lost due to compression algorithms being applied during the creation process (which decreases file size but which may also decrease image quality in some cases). These are the most common image types for graphics that are used for the web as well as for digital photos.
Common raster image file formats include BMP (Bitmap image file), PNG (Portable Network Graphics), GIF (Graphics Interchange File) and JPG (Joint Photographic Experts Group).
|APNG||Animated Portable Network Graphics|
|BMP||Bitmap Raster Image|
|DCM||DICOM (Digital Imaging and Communications in Medicine image)|
|DIB||Device-Independent Bitmap Raster Image|
|DJVU||Graphics File Format|
|DNG||Digital Negative Graphics|
|DXF||Drawing Exchange Format|
|GIF||Graphical Interchange Format|
|JPEG/JPG||Joint Photographic Experts Group|
|ODG||Open Document Graphics|
|OTG||Open Document Drawing Template File Format|
|PNG||Portable Network Graphic|
|PSD||Photo Shop Document|
|TGA||Raster Graphic Format by Truevision Inc|
|TIF/TIFF||Tagged Image File Format|
|WEBP||Modern Raster Web Image|
Localization file formats are designed for two very distinctive roles and purposes – to assist translation professionals in translation by leveraging previous translations or pre-defined terminology, and as a localization resource file where a user can switch between locales/languages within an application.
Translation memories are used to recall previous translations and match against new strings to be translated and also to share translation files between users or when outsourcing content localization. The most common file format for translation memories in XLIFF (XML Localization Interchange Format), with variants created by a number of vendors that have their own vendor specific extensions. TBX (Term Base Exchange) is the most common terminology exchange format. All CAT (Computer Aided Translation) tools will support XLIFF. Many will also support still the legacy TMX (Translation Memory Exchange) format, although this is now a defunct standard.
There are many application localization files with many formats that are specific to the programming language in which they were written. Popular file formats include PO (Portable Object), POT (Portable Object Template), YAML (Yet Another Markup Language / YAML Ain’t Markup Language) and RESX (.NET Managed Resources File).
|MQLXLIFF||memoQ XML Localization Interchange Format (XLIFF Variant)|
|SDLXLIFF||SDL XML Localization Interchange Format (XLIFF Variant)|
|TBX||Term Base Exchange|
|TMX||Translation Memory Exchange|
|XLIFF/XLF||XML Localization Interchange Format|
Presentation files are a type of document that stores information regarding the look and feel of slides, animations, and other presentation data. Presentation files also store important information such as: text, embedded objects, audio/video files and shapes.
The most common presentation file file format is PPTX (Microsoft Powerpoint). Many presentations are also converted into PDF (Portal Document Format) when they need to be shared with others as a published document.
Other Microsoft Office formats are also supported, including Documents and Spreadsheets.
|POT||Microsoft PowerPoint 97-2003 Template|
|POTM||Microsoft PowerPoint Template with Macros Enabled|
|POTX||Microsoft PowerPoint Template|
|PPS||PowerPoint Slide Show 97/2000/XP|
|PPSM||PowerPoint Slide Show with Macros Enabled|
|PPSX||PowerPoint Slide Show|
|PPT||Microsoft PowerPoint 97/2000/XP|
|PPTM||Microsoft PowerPoint with Macros Enabled|
Project and Task
Project files are an essential part of any project management system. Files are used to store tasks, resources, and scheduling to get a measurable output in the form of a product or a service. The most common file format for project files is MPP from Gantt charts can be created and exported as PDFs or images for documentation purposes. Additional file formats include MPX which is used by Microsoft Project and XER which is used by Excel.
Spreadsheets are one of the most versatile data-handling tools available. With a spreadsheet, you can organize and analyze your data in many different ways, including tables, graphs and charts. Spreadsheets are a great way to organize your data, but sometimes you need to save your spreadsheet in a different file format.
The most common file extension for spreadsheets is XLSX (Microsoft Excel Open XML Spreadsheet), but there are other options as well. Other common extensions include ODS (OpenDocument Spreadsheet), XLS (Microsoft Excel Binary File Format) and CSV (Comma Separated Value).
Other Microsoft Office formats are also supported, including Presentations and Documents.
|OTS||Open Document Spreadsheet Template File Format|
|XLS||Microsoft Excel XLS Format|
|XLSB||Microsoft Excel Binary Spreadsheet|
|XLSX||Microsoft Excel XLSX Format|
Subtitle / Caption
Subtitles and captions are lines of dialogue or other text displayed at the bottom of the screen in films, television programs, video games or other visual media. Subtitles refer to a text translation of audio into a different language and are for people who can hear the audio, but may not be able to understand the dialogue. Captions are text in the language of the audio and are designed for anyone unable to hear the audio, they often also contain descriptions of important sounds that would not normally be perceived by anyone who is unable to hear the audio.
The most common file type for subtitles in the SRT (SubRip Text Format) format. However, TTML (Timed Text Markup Language) is the dominant standard with more advanced editors and newer applications.
|DFXP||Distribution Format Exchange Profile|
|PAC||Screen Subtitling Systems PAC Format|
|SCC||Scenarist Closed Captions|
|SRT||SubRip Text Format|
|TTML||Timed Text Markup Language|
|VTT||Web Video Text Tracks (WebVTT)|
Vector graphics are different because they don’t use pixels at all (unlike raster / bitmap graphics)—they’re made up entirely of mathematical calculations called vectors instead! This means that they can scale. Vector images are stored as mathematical formulas rather than pixelated pieces of color. Images are created directly from geometric shapes defined on a Cartesian plane, such as points, lines, Curves and polygons.
Vector graphics are often used for illustrations, graphic design , computer-aided design, and geographic information systems, but can also be used to represent photographs and other pictures. Common file types include AI (Adobe Illustrator Artwork) , PSD (Adobe Photoshop Document) and SVG (Scalable Vector Graphics).
|AI||Adobe Illustrator Artwork|
|CMX||Corel Metafile Exchange|
|EMF||Enhanced Windows Metafile|
|PSB||Photoshop Large Document Format|
|PSD||Adobe Photoshop Document|
|SVG||Scalable Vector Graphics|
|SVGZ||Compressed Version of Scalable Vector Graphics|
|WMF||Microsoft Windows Metafile|
|WMZ||Compressed Version of WMF|
Video file formats are used by computers to store digital media. Because of the increasing popularity of video content, a number of different types of video file formats are available for use. There are many different factors, including file size and quality, that can assist in determining which format you should use for your videos.
The most popular video formats are AVI (Audio Video Interleave), MOV (QuickTime Movie), WMV (Windows Media Video), M4V (MPEG-4) and MP4 (MPEG-4). All these formats support high definition videos with different quality levels ranging from 480p to 1080p and above.
A video can also be converted into an audio file format that contains the audio only from the original video.
|AVI||Audio Video Interleave|
|F4V||Flash video formats|
|FLV||Flash video formats|
|HTML5||AudioVisual media file format|
|M2TS||MPEG-2 Transport Stream|
|MKV||Matroska Multimedia Container|
|MP4||MPEG-4 Part 14|