Sketch Engine Localisation
The Sketch Engine interface can be translated from English into any other language. To to this, we need to translate all the interface strings into the particular language.
The strings are stored in a file ske.po that is generated automatically from the program code to keep it up-to-date. It is a simple text file and the translator needs to fill in the translations: the original English term is referred as 'msgid' and the translations belong between the quotes after 'msgstr'. Everything should be in UTF-8. When finished, the translator should send the resulting file to the developers ( support@sketchengine.co.uk) so that it could be added into the system.
How to find the particular string in the interface
At each translation string in the ske.po there is the name of the file where the string occurs in the system -- e.g. "wsdiff_form.py" -- which is a guideline for where to find the string in the interface. The filename is usually an abbreviation of the function name as it occurs in the interface.
There are some strings that are not worth looking for in the interface because they are rare or do not occur at all because they are either rare program exceptions or come from some specialized applications of the Sketch Engine that regular users do not have access to. In such cases, we suggest skipping the string (the English term will then remain untranslated in the interface) or provide a default translation such as "probably OK" just in case... Such strings include:
msgid "Undefined lemma PoS" msgid "Undefined word form PoS" msgid "Select Positive or Negative filter type" msgid "Evaluation error." msgid "No text type selected" msgid "No output attribute specified" msgid "Empty subcorpus!" msgid "No meta-information to create a subcorpus." msgid "Unknown action" msgid "Insufficient data for" msgid "Insufficient data for id" msgid "Add next" msgid "label" msgid "leaf" msgid "Error Code" msgid "Correction Code" msgid "Incorrect word(s)" msgid "Corrected word(s)" msgid "Error Query" msgid "Search in" msgid "Search the corpus" msgid "name" msgid "Word Sketch Evaluation" msgid "Sketch-Eval" msgid "WS Text-type diff." msgid "Line Group Info" msgid "Annotating" msgid "Automatically annotated" msgid "hits" msgid "Minscore" msgid "Mindiff" msgid "View Word Sketch Evaluation" msgid "View Word Sketch for Evaluation" msgid "Sorry, you are not allowed to evaluate Word Sketches on this corpus." msgid "Evaluation set" msgid "Available records" msgid "Record ID" msgid "Timestamp" msgid "Evaluation Settings" msgid "No. collocations" msgid "Min. freq" msgid "Min. salience" msgid "Word Sketch Patterns Entry Form" msgid "noun" msgid "verb" msgid "adjective" msgid "Store Concordance" msgid "Concordance stored" msgid "Concordance name" msgid "Store Concordance For Annotation" msgid "Select stored concordance" msgid "Selected" msgid "Line group sort" msgid "Successfully Stored" msgid "The following word sketch evaluation was successfully stored" msgid "lemma" msgid "testing set" msgid "filename" msgid "Next Evaluation" msgid "Sorry, creating user subcorpora is not available in the open version" msgid "New pattern" msgid "Number globally" msgid "Number selected" msgid "Info" msgid "Finish" msgid "Clear sel" msgid "display whole document" msgid "Word Sketch for Evaluation Entry Form" msgid "Preloaded evaluation sets:" msgid "Show Word Sketch for Evaluation" msgid "Number of collocations" msgid "Max number of collocates" msgid "per gramrel" msgid "Show salience scores" msgid "Relevant gramrels" msgid "Deselect All" msgid "Select gramrels" msgid "Word Sketch for Evaluation" msgid "Working with existing record" msgid "" "Warning: Another evaluation record for this lemma was found in the database." msgid "Show available records" msgid "Rubric" msgid "Good" msgid "Good but wrong grammatical relation" msgid "Maybe (not striking collocate)" msgid "Maybe (specialized vocab)" msgid "Bad" msgid "Collocation" msgid "Rating" msgid "Salience" msgid "Store Word Sketch Evaluation" msgid "displaying only" msgid "whole word sketch" msgid "New label" msgid "WS Text Type Diff Entry Form" msgid "Word Sketch Text Type Differences Entry Form" msgid "Text Type 1" msgid "Text Type 2" msgid "Tickbox Lexicography - Select Examples" msgid "logging..." msgid "" "(if this page is not redirected in a few seconds, please click the \"Back\" " "button)" msgid "Template" msgid "Sorry, you are not allowed to use the tickbox lexicography" msgid "Tickbox Lexicography template" msgid "Examples per collocate" msgid "Process >>"
Other strings may be difficult to find. Usually they are in titles of some links (popup help) or in applications that the translator is not familiar with. Here we provide a list of strings that were problematic for translators with a short explanation of where the particular string can be found (the list may be updated in the future):
> msgid "Collocation form" > msgid "Collocation candidates" > msgid "In the range from" > msgid "Show functions" on page "Collocation candidates" -- after creating a concordance and clicking "Collocations" > msgid "Showing page” > msgid "<< First" > msgid "< Previous" > msgid "Next >" in case of longer wordlists there is a simple paging in the bottom of the page > msgid "Subcorpus" everywhere you can select a subcorpus, e.g. in the Concordance form -> text types > msgid "View current concordance" > msgid "View concordance" these two are in the new design (beta.sketchengine.co.uk) instead of old "View" in the various concordance functions (e.g. freqs) > msgid "Show frequencies of Node forms" popup help to a link in the concordance submenu > msgid "ConcDesc" abbreviation for "Concordance description" -- gives the formal description of the concordance > msgid "info" subcorpus info (everywhere where you can see subcorpora) > msgid "unspecified" the default value of PoS in the concordance form > msgid "Phrase" > msgid "Word Form" > msgid "Concordance - First query form" in the concordance form (the third is the title) > msgid "Simple" > msgid "Expert options" in the concordance form (new version) > msgid "Set limit" in the frequency lists (e.g. concordance -> frequency -> node tags) > msgid "Header Fields" this lists the structure attributes (e.g. doc.author) of the corpus > msgid "Numbers stand for" switching between document/token counts in listing structure attributes > msgid "#" means "number of" eg. tokens, docs -- in listing structure attributes > msgid "freq" abbreviation of "frequency" -- in wsketch, thesaurus results > msgid "Sample form" concordance -> sample (page title) > msgid "Save Collocations" > msgid "Save Collocation Candidates" collocations list -> save > msgid "Concordance lines" > msgid ">> More" > msgid "other" new version -- result of using the "simple search" in the right upper box > msgid "of" (in view.py) this shows up as "(page) X of XYZ" > msgid "Search" > msgid "in" > msgid "Help" new version -- topbar > msgid "View options" > msgid "Shuffle" concordance submenu > msgid "Attributes" > msgid "Structures" > msgid "References" > msgid "Display attributes" > msgid "Number of lines to be sorted" after clicking "View options" > msgid "Concordance view settings" > msgid "Swap concordance view mode" > msgid "Sort according to the 'References' field" > msgid "Randomize ordering of lines" popup help to the concordance submenu in new version > msgid "Compute a list of all words" wordlist submenu popup help > msgid "Reference corpus" > msgid "Reference subcorpus" keywords result > "Subcorpora operations need more preprocessing. It will take a while, please " > "wait." > "Once the list has been computed, it will be stored, so will be immediately " > "available next time." > "same subcorpus will show you progress of the computation." This message appears when you create the subcorpus and then request a wordlist/keywords for this subcorpus for the first time > msgid "ARF" > msgid "ARF/mill" Appears in keywords result -- "mill" is a million (freq per million words) > msgid "Switch between clustered and unclustered word sketches" mouseover in wsketch submenu > msgid "word frequency" > msgid "cluster frequency" mouseovers in wsketch tables

