# Locale-XGettext Extract strings from arbitrary formats into PO files ## Description When using [GNU gettext](https://www.gnu.org/software/gettext/) you often find yourself extracting translatable strings from more or less exotic file formats that cannot be handled by xgettext from the [GNU gettext](https://www.gnu.org/software/gettext/) suite directly. This package simplifies the task of writing a string extractor in Perl, Python, Java, Ruby or other languages by providing a common base needed for such scripts. ## Usage Included is a sample string extractor [xgettext-txt](bin/xgettext-txt) for plain text files. It simply splits the input into paragraphs, and turns each paragraph into an entry of a PO file. ## Common Workflow The idea of the package is that you just a write a parser plug-in for `Locale::XGettext` and use all the boilerplate code for generating the PO file and for processing script options from this library. One such example is a parser plug-in for strings in templates for the Template Toolkit version 2 included in the package [Template-Plugin-Gettext](https://github.com/gflohr/Template-Plugin-Gettext). that contains a script `xgettext-tt2` which can only extract strings from that particular template language. If this is the only source of translatable strings you are mostly done. Often times you will, however, have to merge strings from all different input formats into one single PO file. Let's assume that your project is written in Perl and C and that it also contains Template Toolkit templates and plain text files that have to be translated. 1. Use `xgettext-txt` from this package to extract strings from all plain text files and write the output into `text.pot`. 2. Use `xgettext-tt2` from [Template-Plugin-Gettext](https://github.com/gflohr/Template-Plugin-Gettext) to extract all strings from your templates into another file `templates.pot`. 3. Finally use `xgettext` from [GNU gettext](https://www.gnu.org/software/gettext/) for extracting strings from all source files written in Perl and C, _and_ from the previously created pot files `text.pot` and `templates.pot`. This works because `xgettext` natively understands `.po` resp. `.pot` files. By the way, all xgettext flavors based on `Locale::XGettext` are also able to extract strings from `.po` or `.pot` files. So you can also make do completely without GNU gettext and use any `Locale::XGettext` extractor instead of GNU gettext for the last step. ## Writing Extractors Writing an extractor is as easy as implementing one single method that takes a filename argument and extract strings from that file. See the manual page [Locale::XGettext(3pm)](http://search.cpan.org/~guido/Locale-XGettext/lib/Locale/XGettext.pm) for more information. See [samples/README.md](samples/README.md) as a starting point for writing an extractor in Perl or many other languages. The distribution currently contains fully functional examples written in [C](samples/C/README.md), [Java](samples/Java/README.md), [Python](samples/Python/README.md), [Perl](samples/Perl/README.md), and [Ruby](samples/Ruby/README.md). ## Differences To `xgettext` From GNU Gettext There a couple of subtle differences in the handling of command-line arguments between extractors based on `Locale::XGettext` and the original `xgettext` program. Report a bug if you think that a particular difference is a bug and not an improvement. One thing that `Locale::XGettext` does not support is the prefix "pass-" for flag definitions. While it is possible for an extractor to implement the behavior of GNU gettext, this is not directly supported by `Locale::XGettext`. Instead, that prefix is simply ignored, when specified on the command-line for an option argument to "--flag" or as part of the set of default flags for a particular extractor. Additionally, while `xgettext` from GNU gettext has a hard-coded, fixed set of supported formats, you can specify arbitrary formats with "--flag" for extractors based on `Locale::XGettext`. ## Installation ### From CPAN You can install the latest version of `Locale::XGettext` from [CPAN](http://search.cpan.org/) with: ``` $ cpan Locale::XGettext ``` If the command `cpan` is not installed, try instead: ``` $ perl -MCPAN -e 'install Locale::XGettext' ``` ### From Sources Download the sources from [Locale-XGettext](https://github.com/gflohr/Locale-XGettext) and ``` $ tar cxf Locale-XGettext-VERSION.tar.gz $ cd Locale-XGettext-VERSION $ perl Makefile.PL $ make $ make test $ make install ``` ### From Git ``` $ git clone https://github.com/gflohr/Locale-XGettext.git $ cd Locale-XGettext $ dzil build $ cd Locale-XGettext-VERSION ``` From here, follow the instructions for installation from sources. The command `dzil` is part of [Dist::Zilla](http://search.cpan.org/~rjbs/Dist-Zilla/). ## TODO The module should ship with its own PO parser and writer. ## Bugs Please report bugs at [https://github.com/gflohr/Locale-XGettext/issues](https://github.com/gflohr/Locale-XGettext/issues) ## Copyright Copyright (C) 2016-2017, Guido Flohr, <guido.flohr@cantanea.com>, all rights reserved.