30 December 2012

Open Source Multilingual Text Proofing with LanguageTool 2.0

Version 2.0 of LanguageTool, an Open Source style and grammar checker, has been released. It supports more than 20 languages, including English, German, Polish, French, and Spanish. The new version fixes some bugs and improves the error detection rules for several of the supported languages.

LanguageTool can be downloaded for free here.

26 December 2012

LT-Innovate News – LT Winners (and some Losers) in 2012

We have delivered over 1,800 news stories for you in the last 12 months. Everyone will have their own best-of list. Here’s a selection of our innovation highlights:


Happy New Year for 2013 with LT-Innovate!

14 December 2012

LT Innovate Weekly – Joining the News Dots – 9 to 14 December 2012

Here’s a customer view of the week’s LT news. Which European sectors were explicitly or implicitly addressed by announcements? You might be surprised!

Lawyers and jurists: Good Law is an ideas movement in the UK that aims to build better readability (including ‘plain’ language), accessibility, and discovery into legal documentation of all kinds. Lots of LT challenges in this text-intensive industry.

Industry-specific documentation managers. The Dutch-funded TEXsis  project has been looking at how to factor domain-specific terminology into the whole document translation process, from authoring through machine translation to editing and reading. In-domain terms are obviously key to customised understanding.

Biomed researchers and their kin: Linguamatics will be playing a key role in MANTRA, another project focused on multilingual terminologies to help mine knowledge simultaneously across multiple language repositories. Faster, more accurate knowledge discovery is a competitive asset in this sector.

Healthcare workers, researchers and patients: LTC has successively completed MORMED, a project that has built a platform for dedicated knowledge-sharing in the healthcare field around specific diseases. Seamless multilingual access to knowledge helps broaden and deepen the collaborative reach of everyone involved.

Business content searchers: TEMIS is to equip Bloomberg BNA’s vast repository of documentary knowledge used by business, legal and research experts. These people need to find the right stuff as quickly and accurately as possible, and also benefit from the “similarity” between content objects to deepen their understanding.

The literally speechless: The UK Creative Speech Technology Network  has released a film showing how various cases of speech production disorders can be aided by speech synthesis technology, including a TV comedian!

Hands-free experts who need to stay connected: Ikanos Consulting has added Ziggy, a speech-driven virtual assistant, to its wireless headset computer. People using these ruggedized headsets can keep in contact and search for information while working with both hands under difficult conditions. Safety, optimised information usage, and a better UX.

12 December 2012

And the Key Ingredient for a Successful Customer Experience Management is…

A recent Cisco survey  underlines the fact that the customer experience is of paramount importance online, and not just price. Overall 40% of respondents said they would be willing to spend more with a company if they improved the overall customer experience, In Western Europe the figure is 81% and 90% in the UK. But as is so often the case in these surveys, there is one monster missing factor: language equity.

Interestingly, a good third of the respondents (35%) want businesses to ensure they can easily ask questions and access information before making a purchase. This sort of behaviour is as old as the hills in the world’s traditional bricks & mortar shops and marketplaces, and is so natural a practice as to almost go unnoticed. But online this sort of exchange is complex to manage, though it can bring enormous added value to the retailer due to the rich usable data embedded in the query process.

Online, the natural first step is to trawl through the retail website, or check out the usually limited information available on product FAQs The next would be to talk (live) with the ecommerce site in question - or chat (live with a keyboard) if you sitting in a crowded train. The emerging platforms for live video chat using a  tablets will soon replace these. Or you always fall back on email. Another fast-trending way to get information indirectly is to consult or mine your social network and its long comment tails, though you’re more likely to get opinion than the full facts.

The eventual transaction will be sealed with an exchange of virtual money executed by a single click on an electronic button. But the prior information-gathering processing will depend entirely on the monster missing factor every centimetre of the way: a customer using their language to reach a goal.

Almost every exercise in customer experience management using today’s communications tools will need to put language management high on its list of must-haves. Way ahead of all the other bells and whistles that will win customer confidence, build loyalty and accelerate ‘conversion’. It’s all so obvious…yet so often forgotten.

Negotiating in a marketplace, a smart merchant will usually adapt their language to yours. Online, that means an end-to-end (not just a landing page) experience in a customer’s language – text, voice, videos - the lot. For the LT industry the issue is clear: it is not about how to introduce translation tech everywhere to see if its work in situ. It means starting from deep knowledge about the customer experience and inventing transparent language solutions that perform the right job, whatever you decide to call them.

11 December 2012

Bloomberg BNA Selects TEMIS for Content Categorization and Enrichment

Leading Legal Publisher Applying Powerful Luxid® Content Enrichment Solution to Vast Archive.

New York, NY– December 11, 2012 – TEMIS announced that it has signed a multi-year license and services agreement with Bloomberg BNA for use of the flagship TEMIS software solution, Luxid® Content Enrichment, to provide a key capability in the management of its multimillion-document database. Bloomberg BNA is a wholly owned subsidiary of Bloomberg and a leading source of legal, regulatory, and business information for professionals.

Bloomberg BNA has chosen to deploy the Luxid® semantic tagging and linking platform as a powerful method for categorizing news articles and other content, consistently indexing unstructured data against a comprehensive legal taxonomy.

“We are honoured that our flagship Luxid® Content Enrichment platform has been selected by an industry leader as highly regarded as Bloomberg BNA”, said Guillaume Mazieres, EVP North American Operations of TEMIS. “TEMIS will help Bloomberg BNA deliver accurate, targeted and always relevant information to its demanding professional customers, offering an unparalleled user experience.”

“Bloomberg BNA strongly believes that powerful content enrichment technology such as Luxid® from TEMIS will enhance subscribers' interactions with our content, improving the efficiency of search as well as opening up interesting possibilities for connections based on document similarity”, said Audrey Hipkins, Chief Product Officer at Bloomberg BNA.

About TEMIS

TEMIS helps organizations structure, manage and leverage their unstructured information assets. Its flagship platform, Luxid®, identifies and extracts targeted information to semantically enrich content with domain-specific metadata. Luxid® enables professional publishers to efficiently package and deliver relevant information to their audience, and helps enterprises to intelligently archive, manage, analyze, discover and share increasing volumes of information.

Founded in 2000, TEMIS operates in the United States, Canada, UK, France and Germany, and is represented worldwide through its network of certified partners.

TEMIS’ innovative solutions have attracted the business of leading organizations such as AAAS (American Association for the Advancement of Science), Agence France-Presse, BASF, Bayer Schering Pharma, BNA (Bureau of National Affairs), BNP Paribas, CARMA International, Editions Lefebvre-Sarrut, Elsevier, EMC, Europol, French Ministry of Defence, French Ministry of Finance, Gannett, Karger, Invest in France Agency, Merck Serono, Nature Publishing Group, Novartis, Philip Morris International, PSA Peugeot-Citroen, Sanofi-aventis, Simon & Schuster, Springer Science+Business Media, The McGraw-Hill Companies, Thieme, Thomson Reuters, Trinity Mirror plc and the U.S. Department of Agriculture.


Author: Martine Falhon
Corporate Communications
martine.falhon[a]temis.com
Tel.: +33 (0)4 56 38 24 03

07 December 2012

LTInnovate’s Weekly – Joining the News Dots – 1-7 December

Simplifying the Content Workflow
ABBYY’s new Mobile OCR app will help you digitise content in situ, with Acrolinx’ new 3.0 release you can SEO your content to make it optimally searchable , and Summly will summarise it all into 350 words for small form devices once it’s published.

Niche Applications
As always, it is the business case that matters most. Inbenta can help you manage email content for CRM units, while Elsevier is providing Meditech’s electronic health records with its ExitCare patient-centric content in English and Spanish. And if you haven’t yet realized that linguistic analysis (not just stats) is a must-have for effective text analysis of any content under the sun, then read Bitext’s white paper on the topic.

Making Multilingual Work
Premier MT supplier SYSTRAN has brought out SYSTRANLinks to simplify website localization . And to help you make MT quality evaluation less of a headache, TAUS has launched its Dynamic Quality Framework toolset to centralise and share best practices on this thorny subject. Two possible candidates are the European and Chinese Patent offices, who have just launched a joint English-Chinese MT system for IPO documents using well-known technology. Is bilingual the same as multilingual?

The Business Value of Audio Recordings
Technology can now help harness vast volumes of spoken content – accented, emotive, personal, in-your-face, revealing language data. This speaks volumes to different constituencies, including VOC analysts, security forces, healthcare personnel, educationalists, speech technologists, and more. Three examples this week: Wolters Kluwer and FX are to make audio recordings of healthcare sector meetings available; Natterbox now integrates with NICE systems to deliver centralized fixed/mobile audio recording for customer analytics and BI. And TextMaster has launched a first-of-its kind mobile app to transcribe/translate spoken recordings.

SYSTRAN Launches SYSTRANLinks Simplifies website localization

SYSTRAN, today announced, during the Web 2012 Paris, the release of SYSTRANLinks, an online service which makes website translation faster, easier and more cost-effective than current solutions. SYSTRANLinks is now live and available for use by small to large businesses, eCommerce sites, language service providers, Web agencies, and innovative individuals wanting to extend their global reach, by subscribing to one of four service levels: Free, Standard, Pro, and Enterprise. For more information or to test drive this exciting new service visit www.systranlinks.com.

Companies recognize the value of being able to talk to multinational audiences in their own languages. The challenge is to create geo-friendly Web content that can penetrate new markets, surpass customer expectations and enhance the value of the brand it represents.

However, rolling-out website translations can be an extremely difficult and expensive effort, involving a complex maze of diverse technologies and costly suppliers.

SYSTRANLinks empowers companies with a rich range of tools which expedite the website localization workflow and optimize quality in a cost-effective manner. Businesses can launch and manage website localization projects via SYSTRANLinks’ innovative and reliable collaborative online platform. In just a few clicks, SYSTRANLinks reproduces, pre-translates and hosts new websites. The user-friendly tools allow editors to continually enhance content, shortening the time to break language barriers so businesses can penetrate new global markets and develop opportunities.

The full press release is available for download here.

Grasshopper reduces the number of incoming e-mails by using Inbenta on their customer support portal

Inbenta, a provider of Natural Language Processing technologies and Semantic Search, member of the LT-Innovate Network, announces that Grasshopper was able to reduce the number of oncoming e-mails by using Inbenta Semantic Search on their Customer Support portal. Grasshopper customers are now able to find more of the answers they’re looking for without submitting a ticket to their support team.

Inbenta’s “Instant Email” functionality, available as a Zendesk Integration, dynamically retrieves relevant articles and FAQs while the users type their e-mails, without disturbing or altering their user experience. As users are provided with relevant information as they type an e-mail, they don’t have to send their email and wait for a response.

As Allison Canty, Social Media and Community Manager at Grasshopper, says:
"We saw results almost immediately after implementing Inbenta on our Zendesk powered support site. In the first two months alone, the percentage of unanswered questions went down, from 21.83% to 8.29% and our click-through ratio increased by 34.4%."

"In the first week after implementing Inbenta’s dynamic FAQs on our “submit a request” page, we measured a 22.91% deflection rate for Emails/Tickets,"

"With Inbenta’s help, we were able to identify topics we were lacking in content and create it, based on Inbenta’s suggestions, which helped us greatly improve our online Customer Service. Our customers are now getting more relevant answers to their questions, faster and without submitting a support ticket. It’s a win-win for everyone!"

06 December 2012

Language Technologies: Establishing Europe's Global Market Position & Securing the Digital Single Market

On 6 December 2012, top executives of European Language Technology companies, who have joined forces in LT-Innovate, launched a Discussion Document calling for a major effort to deploy language technologies as a key building block of the Digital Single Market and enabler of a language-neutral eContent economy. 

The Discussion Document underlines that “mastering human language is the next big opportunity in Information and Communication Technologies (ICT) and Language Technology (LT) is a key technology of the future. Europe (a multilingual continent with more than 63 languages) should step up its investment in LT as strategic key enabling technologies that will determine the continent's economic competitiveness as well as its cultural integrity".

In one its main proposals, the Document calls for a “European Language Cloud” (ELC) which would considerably bring down the costs of cross-border products and services and allow all Europeans to seamlessly interact with the 1 billion+ market of speakers of European languages. The ELC would constitute a major opportunity and boost for the European economy and society, creating many new jobs. Such a platform should also support the languages of Europe’s major trading partners, making European companies and citizens fittest for the global markets.

Jochen Hummel, LT-Innovate Chairman, comments: “There is no lack of innovation in Europe. The main stumbling block is that Europe’s SMEs do not grow beyond their national or regional linguistic islands to address European and global markets. European SMEs must get access to a market of continent-wide dimensions, through European-scale projects backed by European-scale funding. The European Commission’s Horizon 2020 Programme offers a major opportunity to boost innovation and regain competitiveness; but European Programmes need to become results-driven and should put innovators and job creators, in other words SMEs, into the driving seat!”.

Download the Discussion Document here... and comment on it below!

MORMED: a new platform for seamless, interactive, and multilingual collaboration in healthcare and life sciences

LTC is proud to announce the availability of a new social networking community platform in the medical domain. It features near real-time communication in multiple languages for wikis, blogs, forums, polls etc., together with semantic analysis, and a content recommender.

LTC successfully completed the final review of the MORMED project - Multilingual Organic Information Management in the Medical Domain - on 28th November, 2012. Fifty per cent funded by the European Commission, MORMED enables instant global knowledge sharing and interactive communication across language barriers. Its fast, automated and secure translation solution continuously improves by learning from human quality feedback. The system also provides enhanced content analysis for users by suggesting suitable tags for content creation and recommending relevant information.

Key features include an innovative translation workflow back-end and intelligent, language-neutral information processing capabilities. All functionality is available through a dynamic user interface. Users can easily understand the information exchanged within community groups regardless of the language used.

The benefits of this platform have been clearly demonstrated for the user communities of two conditions: Lupus and Antiphospholipid Syndrome. This was clearly brought out in comments from the reviewers, one of whom said that the “partners invested a lot of time and energy to make this project successful, and we commend the continuation of LTC’s substantial efforts in creating a superior product. We wish LTC and its partners every success in marketing MORMED to potential customers in the medical domain and beyond.”
Susan Fraser, EU Project Officer, added that “it is a satisfaction to see that an EU-funded project has produced results that are clearly beneficial to the end-users, as testified by the moderator of the live MORMED platform Angie Davidson, Campaign Director of the St. Thomas’ Lupus Trust, during the review meeting.”

LTC will now make the highly innovative MORMED solution available to enable specialised communities to dynamically share information and collaborate interactively in overcoming language barriers across a range of usage scenarios.
About LTC

LTC provides cutting edge and validated language technology and services that accelerate time to market, create new global revenue opportunities, expand worldwide brands and drastically reduce operational costs. LTC offers seasoned expertise in all aspects of authoring, managing and delivering corporate content in multiple languages including workflow, and all aspects of multilingual technology.

More information about LTC/Mormed, please contact: 

Bradley Harrad
Tel: +44 (0) 2085492359
Email: bradley.harrad@ltcinnovates.com 

04 December 2012

Lingle Online Appoints a Learning Industry Professional to the Board

LingleOnline Ltd. has announced the appointment of Eric Baber, IATEFL President and member of the European Learning Industry Group executive committee, to the Board of Directors. CEO Ian Butler described him as “a highly experienced educator and distinguised advocate of technology in language learning”. He further added “Eric brings a wealth of knowledge in teaching with technology together with an in--‐depth strategic understanding of educators’ needs” to Lingle.

01 December 2012

LT Innovate’s Weekly - Joining the News Dots – 24 Nov- 1 Dec

Translation: Speedy, Crowdsourced and Absolutely Vital
While Finnish-firm Multilizer released version 3 of its translation management solution and the Jordan newbie Dakwak launched a powerful new localisation platform presence for MENA regional start-ups , the rest of the industry seemed be focusing on the still uncertain virtues of crowdsourcing. Lionbridge acquired crowdsourcer Virtual Solutions while Adobe launched its own translation crowdsourcing community site  to explore best practices. Sign of the times, the EPO co-launched an English-Chinese engine with China’s patent organisation.

Virtual Assistants: Personal, Smart and Coming to a Screen Near You
The buzz around IVAs (intelligent Virtual Assistants) is growing. Intel Capital has taken a stake in Spanish IVA maker Indisys . Could this be related to Intel the company’s effort to prepare interface developers for its new perceptual computing kit? In another move, the French IVA firm Akio, has snapped up Dialonics, a dialog solutions compatriot All this probably explains why voice specialist research house Opus has published one of the first reports of its kind on ‘personal virtual assistants’.

Semantics & Content – a Natural Partnership
In the UK, Concept Searching signed semantic information management partnerships with mining specialist RKO in Western Canada, and IT services firm Black Blade while Brand Embassy acquired start-up Beepl semantic search supplier. TEMIS will now deliver its semantic enrichment technology to global publisher Datamatics, which is especially focused on the eBook marketplace. Deeper down in the infrastructure, Norwegian developer Webnodes AS released a Semantic Integration Server to bring more semantics into database technology 

Member States of Language
Sweden pushes its model for multilingualism , Wales is proud to open a Clinithink global &D centre , and Ireland is building a cloud computing research centre.

Exploit of the Week
The CreST creative speech tech network is driving a road-show around Northern England to boost awareness of …yes, speech technology!

30 November 2012

Language Interpretation in Search of the Right Technology

Recently Microsoft demo’ed its new speech to speech (S2S) translation system that delivers a target language version in the “voice” of the speaker. A remarkable breakthrough, but does it herald the arrival of automated interpreting, as some people would like to think?
Probably not. Conference/meeting/trial/interrogation/medical interpreting is a different animal from the scenarios usually put forward for spoken translation. Whereas an S2S app may help you solve a personal communication problem in a hotel, police station, or restaurant, it can’t yet substitute for full-scale professional interpreting.
As Mark Seligman of Spoken Translation demonstrated recently at a TAUS session on S2S translation, a key component in a professional S2S tech mix is a ‘back translation’ channel – a method for double checking that the meaning of a polysemic word or phrase (i.e. with multiple possible meanings) has been properly translated. Without this back-translation, conversations can go deeply wrong very quickly. Yet real-time back-channel quality control is obviously not feasible for conference interpretation situations.
This means there is no point today in strategizing a direct pathway from the full S2S MT of existing apps such as Jibbigo or SpeechTrans to the standard professional interpretation situation. But as a recent insightful post from a director of ZipDX (a technology provider to the profession) shows, interpreting should embrace rather than disparage the promise of automation enhancement. There are three specific areas where innovative technology could help: speech semantics, voice quality, and device agnosticism.

Speech tracking as a productivity tool
Rich interpretation-specific speech resources should be built up to allow smart developers to invent apps that can leverage intrinsic intelligence for interpretation performances. One example: a monitor could track in real time what is being talked about in the meeting/conference, search in an interpretation memory cloud for previous translations, and make them available as written on-screen terminology prompts to the interpreter. In other words, transposing recent ‘memory’ tools developed for written translation to spoken translation could provide language aids that go beyond the term lookup tools typically found in the interpretation booth.
There are naturally legal issues about recording meetings, but if we could develop a multi-language event streams records all language streams in parallel in the cloud, then anonymised resources could be built up to help interpreters stand on the shoulders of their colleagues, and also stimulate LT developers to innovate with new smarter systems.

Good sound quality is critical
In international meetings, interpreters usually work on the premises using the local audio system. This is particularly costly due to the price of presence. As an alternative, there are now many mature collaboration telecom solutions that could help more interpreters work remotely (possibly using telepresence) under much improved audio conditions – a vital condition for good interpreter performance over the telephone.
Recently the German research institution Fraunhofer ISS released its Full-HD Voice codec that supercharges the communication quality of any VoIP app, providing the kind of professional quality that interpreters will need whatever their communication channel. Microphone makers such as Philips are also reducing ambient noise to boost audio quality in recordings.

Videoconferencing and the device/channel revolution
Interpreters always like to see the speakers whose words they are translating, together with any visual media used in the meeting. As smartphones, tablets and even TV sets are now part of the media mix for content sharing, there are multiple possibilities for a BYOD (bring your own device) agenda for interpreters. This would preserve the rich content of any meeting while also enabling interpreters to join the collaboration revolution driven by unified communication..
In addition to mainstream videoconference and telepresence players, providers such as Skype, a Microsoft company, are now deeply integrated with the voice features of Windows 8, for example. They are offering cost-effective communication platforms that could easily provide the visual capabilities to add value to interpretation functionality. This would enable cash-strapped SMEs to join in the multilingual global conversation under good quality conditions.
Do you have any ideas about how to make interpretation a more integrated function of digital communications?

26 November 2012

LT-Innovate’s Weekly - Joining the News Dots - November 19-24


Competition in the Intelligent Virtual Assistant (IVA) Space. In China, Ifyly's Tek Yu Dian dominates the mobile phone market. In Europe, Spain’s Inbenta has launched an IVA for enterprises, and Speaktoit has notched up 5 million users. It’s hardly surprising that Taiwan device makers believe that speech interfaces will be the next big thing after touch screens.

Translating Market Data into New Solutions. Middle-eastern company Dawak claims that failure to multilingualise ecommerce sites is costing trillions in lost business, while CSA reckons the manufacturing sector is worth a juicy $11B to the translation industry (a third of the total). That’s why Atril has launched a new web translation management product TEAMServer, Transfluent is expanding its domain range, and Lingo24 has loaded a new MT solution.

How Digital is your Language? Manx (spoken in the Isle of Man) gets a revival app, Portuguese is the fastest growing language on Facebook, and new multilingual social network Hamu will be trying to get their respective speakers to chat together.

Smarter Health Tech. AMI has delivered some clever coding to simplifying medical data recording; Nuance says radiologists see ASR as a must-have, Wales is strategising for Welsh to become a language of healthcare, smartphones and social media are becoming the new home medical encyclopaedia, and the UK government wants Britain to lead in health technology.

Kings of Content. Pearson is to leverage its educational publishing assets into smarter online content for schools, while Wolters Kluwer is expanding to Asia, expanding its Chinese workforce. Wherever you are you can benefit from a Springer API for more intelligent approaches to scientific; technical and medical content.

Customers of the Voice. Fraunhofer is improving VoIP voice quality, which should further enhance Skype’s new Android update as it pushes deeper into mobile voice and video services. Better channel quality would also add value to Belgian firm Acapela as it launches a new range of expressive TTS voices for content.

21 November 2012

TechScribe Releases OS Term Checker for Industry Controlled Language Standard



The UK technical writing aid provider has released an open-source term checker solution for ASD-STE100 issue 3 on its site. The ASD-STE100 standard is an international specification for preparing content for maintenance documentation in a controlled language. Until now software needed to help technical writers conform to ASD-STE100 has been very expensive. This TechScribe solution democratizes the technology.

19 November 2012

Multilingualism needs to be business-driven, and "no size fits all", in particular for SMEs


The Final Workshop of CELAN – Language strategies for competitiveness and employability took place on 15 November 2012 in Brussels and was attended by key stakeholders such as business associations, Higher Education representatives, decision-makers but also companies that implemented language strategies successfully and language technology providers. 

"Multilingualism is key for business, growth and the Europe2020 strategy" emphasised Sonia Peressini, DG EAC, Multilingualism Unit, in her welcome address. And Wolfgang Mackiewicz, Freie Universität Berlin and CELAN coordinator evoked again the two main tenets CELAN came to: "multilingualism needs to be business-driven, and "no size fits all", in particular for SMEs". 

Workshop presentations:

The beta version of a needs assessment tool is available now. Any feed-back is highly appreciated to letter-box@emfs.eu.

16 November 2012

taraXŰ: The first end-to-end machine translation environment project status as of October 2012


The rising demand for machine translation (MT) and the resulting implementations have shown that workflows and system architectures can become very complex. Particularly, the setup of combination or hybrid systems making use of different MT engines running in parallel has kept many language service providers from introducing MT technology.

To address this issue, euroscript teamed up in 2010 with a strong project consortium in the highly ambitious R&D project taraXŰ. The project has by now resulted in the first advanced machine translation environment that deploys amongst other things:
  • Corpus and translation management (running different rule or template based and statistical machine translation engines in parallel)
  • Automated source- and target-language quality assessment
  • Automatic selection of the best translation through innovative methods
  • A web-interface allowing human translators to correct machine translation output and assess its quality by ranking, post-editing, error-analysis, etc.

"The taraXŰ project gives us an unprecedented opportunity to perform applied research in a human-centric setting. The early inclusion of human translators in the development process is definitely a winning strategy to further improve machine translation quality" says Hans Uszkoreit, Scientific Director at the German Research Center for Artificial Intelligence (DFKI) and Head of DFKI Language Technology Lab.

The taraXŰ project consortium relies upon expertise in translation services, machine translation and evaluation, language checking, language technology, and related fields. Project partners are Acrolinx (member of the LT-Innovate Network), the German Research Center for Artificial Intelligence (DFKI), euroscript Deutschland and yocoy (member of the LT-Innovate Network and winner of LT-Innovate Award 2012).

SoDash has won a Real Business / Wonga "Future 50" award for it's NLP-powered social media analysis.


Guest post from Winterwell Associates, member of the LT-Innovate Network.

Firms need to interact with social media such as Facebook and Twitter. But it’s a hell of a chore. Much of the chatter on these networks is, well, just that.

So how to winnow the wheat from the chaff?

Two Scottish PhDs have developed an algorithm-powered Artificial Intelligence social-media dashboard that offers some truly extraordinary tools for firms seeking to improve their performance.

They explain it thus:

The SoDash AI can be trained to distinguish between what’s relevant to your business or campaign, and what’s just pointless chatter. While other tools determine sentiment solely based on generic positive or negative words, SoDash is trained to decipher elements such as a message's structure, content, and who’s sending it. For example, ‘Irn Bru is a sick drink!’ means something very different to ‘Irn Bru tastes like sick!’. SoDash puts all this into context to build up a deep understanding of why certain messages mean certain things to each client.

SoDash clients define their own categories and enjoy powerful automation and analysis tailored to their specific definitions. The AI automatically tags messages with bespoke labels, delivers market information, ghost-writes responses, provides exceptionally accurate reports and much more. With SoDash, clients can quickly make sense of the countless conversations flowing across social media platforms and turn them into valuable, performance-defining research.

Virgin, Universal and Phones4U are all using it. SoDash revenues are growing fast. Well worth following.


Author: Dr. Daniel Winterstein, Winterwell Associates.

13 November 2012

Open Call (+€7K!) for Collaboration from Fusepool


The Geneva-based Fusepool initiative is offering up to €7.000 to any SME or similar interested in co-creating and testing the flagship applications developed in one of Europe’s leading data-pool platforms!

If you are an SME - or working closely with SMEs - and interested in data tools for patents, tenders, partner matching or customer feedback, then the Fusepool team is interested in your help to deliver usable and effective applications as close as possible to your user needs. Find all the information and details here.

Fusepool is dedicated to refining and enriching raw data using common standards and provides tools for analyzing and visualizing data so that end users and other software receive timely, context-aware and relevant information whenever they need it and wherever they are. To ensure high quality data and results, Fusepool combines well-defined but error-prone (semantic) Web 3.0 with controlled supervision and the collaborative but often messy (social) Web 2.0.

12 November 2012

Is it Worth the Investment to Develop in Your Native Language if it isn’t English?


Guest post from Kwaga, member of the LT-Innovate Network.

Creating your own online start-up is a fascinating, challenging and certainly risky path. While there are a number of “invigorating challenges” along the way, there is one that is a particular thorn in the foot of many of non-native English speakers.

Launching your venture in English AND your native language.

Our Paris start-up, Kwaga, felt compelled to develop our applications for the larger international market in English, but has also done so in French, the language of the majority of our team. It makes sense, right?  Yes, but does that mean it’s worth doubling the workload at every step?

It depends.

Our expertise at Kwaga is in natural language processing and after a few initial “pivots” we developed something that’s really caught on; our flagship product, WriteThat.name, analyses email signatures and updates our clients’ address books automatically. The complex processing chains we developed must cover at least two languages, as experience has taught us that multilingualism cannot be improvised: it is the overall architecture of a processing chain that is multilingual or not and recoding a monolingual prototype is not the sort of nightmare our developers would want to relive! 

Packaging the product or service in multiple languages can be very time-consuming.

Designing and implementing multi-lingual user interfaces takes twice as much time and can be costly, so here are a few things to keep in mind:
  1. Do you outsource the English?  Translating documentation isn’t that expensive, but the cost of continually changing your website can be significant, make you less agile and of course take more time.
  2. Translating unique terminology and product names can be really tough and certainly takes a few back-and-forths before settling on what feels “just right. So outsourcing this can become quite expensive and time-consuming.
  3. Testing new interfaces, correcting bugs etc. are of course twice as time-consuming
  4. At some point, we should mention that coding in multiple languages means coding accents, yet not all APIs and development environments are even set up for this.  This means finding “hacks” or workarounds that are – again - time-consuming and can result in bugs as products evolve.
  5. Two different texts might not fit into the same image on a website, not to mention the issue of encoding and fonts: most European languages are much richer than English in diacritics and can be another headache (as mentioned in point 4).
  6. It’s always good to do some A/B testing to really get your site and communications fine-tuned which of course means not writing twice as much... but four times (and that’s if you’re only working with 2 languages)!    
  7. Do you develop two separate social media streams as well?  If you choose to do only one, yet reply in many, some clients can become annoyed at reading a foreign language.
Our Solution and Current Tough Decision
Once we had a good product fit and enough traction, we decided to hire Native English speakers for customer support, communications and marketing as the international English-speaking market quickly became our dominant user base.

At this point, we sometimes wonder if continuing to market in French is a good business decision.
Should we hang onto bilingualism at the expense perhaps of our agility to develop new products? Our ability to process messages in many languages is certainly a competitive advantage in a global market, but maybe the interfaces and communications in French are a luxury that a start-up cannot afford?

We wonder how these kinds of decisions are impacting your language technology or start-up ventures and would love to hear how you’ve approached the subject.

Author: Gaëlle Recourcé, Chief Scientist at Kwaga.
(PS This was translated from French by Brad Patterson, hence it took twice as long as well ;-) 

08 November 2012

Eptica has acquired multilingual semantic search engine and “sentiment analysis” software developer Lingway.


Eptica, a leading provider of Multi channel Customer Interaction software, today announced the acquisition of multilingual semantic search engine and sentiment analysis software developer Lingway .

Lingway's advanced technology will strengthen Eptica's multichannel customer service suite, enabling organisations to improve the customer experience and benefit from increased consumer insight.

Eptica enables organisations to create the best customer experience by delivering the answers customers want on the channel of their choice (phone, email, web, chat, social media, and mobile). Through its powerful technology, Eptica's software ensures every request is handled efficiently whether managed through a self-service channel or the contact centre.As Eptica records the complete multichannel history of every customer contact, whether through Facebook, email, chat or Tweets , companies benefit from improved customer management, reporting and insight.

Videos of the TAUS User Conference 2012 on Translation Automation Now Showing


The full videos of contributions to the recent TAUS User Conference are now available online They include two panel sessions on how (large IT) translation buyers and translation service vendors see the future; contributors including Translated.net and Moravia. There is a clear sense that consolidation is likely in the service sector, and that continuous content streaming, crowdsourcing, and machine translation will now be the norm on the buyer side. The shift to a new, more agile market for translation services may well favour new, innovative players.

You can also watch a session on speech-to-speech translation (with demos), and a host of rapid-fire technology showcases proposing updates or new ideas in translation automation from small or large translation tech firms from around the world. TAUS conferences play a major role in developing a sharing and collaborative mindset among translation players from all over the world, especially Europe. Not just sharing ideas but also in building a practical platform for sharing resources.

It is worth remembering that LT-Innovate has found that two thirds of the top 100 vendors in the “globalisation industry” are based in Europe (half of the top 10), and that there are several thousand companies offering Translation Technology services of various sorts, many of them micro-enterprises but including a significant number with revenues over €50M. Their future may well depend in part on what we pick up from open discussions at TAUS events.

Author: Andrew Joscelyne

The BBC: from Auntie to Lady Semantica

The BBC is affectionately known in the UK as “Auntie”, probably for its gentle and slightly old-fashioned didactic style. But deep in its IT ecosystem, the huge broadcaster is a hot bed of innovation. Not for nothing is it ranked only second to Google as the “favorite place to work” for LinkedIn techie job seekers.

For anyone interested in how a major content publisher is embracing the challenge of language technologies, check out this long interview with BBC ‘semantic web’ people. After full-scale coverage of the World Football Cup last year and the London Olympics in 2012, the content team have been exploring all the implications of delivering tailored archival content for a cutting-edge online user experience. Or what LT-Innovate is calling “intelligent content”. 

Below is a summary of what they’re thinking about today:
We are currently exploring various other uses of Semantic Web technologies within BBC R&D. In particular we’re looking at ways in which Linked Data can be used to help search and discovery of archive content. We have been working on automatically identifying the topics and the contributors for BBC programmes from their content, using a combination of Linked Data, signal processing, speech-to-text and Named Entity Recognition technologies, which we have been talking about in various places, such as the Linked Data on the Web workshop and at WWW’2012. The automatically generated links from programmes to entities described in the Linked Data cloud might be incorrect in places, so we are also exploring how users can validate or correct those links, and how this feedback can be taken into account within our automated interlinking workflow. We are planning to write in more details about our experiments in that space on the our blog in the next couple of weeks.

Check out their blog to keep abreast of Auntie’s rapid reinvention as Lady Semantica. 

Author: Andrew Joscelyne

05 November 2012

The LTi News Roundup - 5th November 2012 (part 1)


Weekly news round-up prepared by the Editorial Staff of LangTechNews for LT-Innovate, the Forum for Europe’s Language Technology Industry.

LT-Innovate: October European News Round-up

Introducing a new LT-Innovate service - a monthly update up of must-have news about events impacting the European LT industry from our dedicated site.
October traditionally marks the start of Q4 for businesses, a massive global conference season, product launches (in the consumer run-up to Christmas), and much strategizing and predicting about the coming year. Which means plenty of news flowing around the LT sector.

Big Content Publishing

Pearson and Bertelsmann have agreed to merge Penguin and Random House book publishers to build the world's largest consumer (popular and educational) publishing house with an eye on the still-emerging e-book or e-reader market. For the LT sector, a global player will mean large language resources for testing and expanding language technologies. E-books will draw on voice synthesis, and literary and educational content will need translating quickly when best-sellers hit the global markets. Digital convergence of text, video, and sound technologies should also drive development in innovative new 'cultural' products. 
Meanwhile technical publishers are further honing the usability of their wares. Elsevier has announced that chapters will now be a natural unit of online information for the technical publishing market, and will add chapter-specific metadata to help users search for the precise content they need. And Wolters Kluwer has acquired the company Health Language to boost the searchability of its point-of-care content by incorporating high quality medical terminology into its search technology. 
Film buffs and subtitle technology suppliers should note that the British Film Institute is planning to digitise 10,000 films over the next five years, possibly providing a vast store of spoken language data and a very long tail of speech translation opportunities.
A handy content generation aid (aka authoring) comes from the Norwegian firm iFinger which offers word/term search on or offline from leading content providers (Collins, Ernst Klett Verlag and Cappelen Forlag).and Wikipedia in all languages.

The LTi News Roundup - 5th November 2012 (part 2)

Weekly news round-up prepared by the Editorial Staff of LangTechNews for LT-Innovate, the Forum for Europe’s Language Technology Industry.


Making Translation Simpler

Lots of conferences and networking in the translation industry this month with the TAUS User Conference, tcworld and LocWorld events among others. One small trend: the emergence of what might be called “translation analytics” – i.e. business intelligence about the people and processes involved in translation. The Luxembourg provider Wordbee released a new business analytics module for their Enterprise Translation Management System, and discussions among TAUS service vendors focussed partly on the need for more data on the translation process so as to optimise anything from technology to translator selection. This is only natural as analytics of all kinds become part of the Zeitgeist.
When it comes to operations, the trend is to simplify workflows to reduce process time in the production chain. TranslateKarate for example delivered a super-simple online workflow, TAUS has launched a stripped-down API to streamline translation content exchange after inheriting the mantle of standards watchdog earlier last year. Meanwhile the Irish start-up KantanMT launched a BetaIV version of its cloud engine as part of its continuous development agenda, in a bid to attract more customers before the paying service kicks in. And Kilgray received a Deloitte Award as one of the leading young tech companies in Hungary, rewarding its insistence on building a user ecosystem bottom-up together with its customers.

Language Learning

In a global language learning market worth $58.2B in 2011 (including individual and enterprise services), it is surprising how little serious innovation news swims into our focus. So it was good to hear that Irish start-up RendezVu received an EC seal of approval for its ExamSpeak application to help learners prepare for exams. 
In another move, the online language learning busuu with over 25M users in 200 countries has won another round of funding and is moving to London to stay closer to their VC partners and HarperCollins publisher – and perhaps benefit from the positive tech-biz vibrations in the UK. 
AABBY has meanwhile released a set of Lingvo Dictionaries for iOS which offer learning friendly information and pronunciation aids for smart phone users.
In a big data world, it’s also worth noting that the Swiss firm Education First has published its latest report that rates countries on their proficiency in speaking English. Although it is obvious why English is the language of choice for this exercise, it would be interesting to have data about proficiency in other languages. And to fine-tune the analysis below the level of countries (too many are almost neck and neck) to other useful demographics.

The LTi News Roundup - 5th November 2012 (part 3)


Weekly news round-up prepared by the Editorial Staff of LangTechNews for LT-Innovate, the Forum for Europe’s Language Technology Industry.

Text Analytics 

In the busy world of sentiment detection and text analytics, there seems to be a growing need to treat languages in the plural. Witness the choice of the Spanish company Bitext to join the Salesforce Marketing Cloud Social Insights Ecosystem. This will offer it access to more customers and enable it to promote its Spanish language analytics solutions more globally. Interestingly, this ecosystem has attracted the US firm LinguaSys which also specializes in multilingual analytics and strongly believes that you cannot do proper text analytics via translation. European social media analysis specialists would seem to be well-placed to provide the added value of local language insight.

Pressure on Unified Communications?

A report from the UK found that open standards are needed in the videoconferencing market to simplify the complex puzzle of video hardware and services. Videoconferencing in a time of shrinking travel budgets looks like a no brainer, and the arrival of new interfaces – tablets and smartphones – is putting pressure on suppliers to democratise this still-expensive meeting facility. At the same time VoIP companies like Skype with its Windows 8 integration and the German company FriendCaller are offering user-friendly alternatives to full-blown telepresence. In due course they will benefit from recording, summarisation and speech search and translation technologies that can help transform calls and online meetings into actionable rich content resources. Especially where risk, compliance and confidentiality play a key role

29 October 2012

DG CONNECT's Stakeholder Survey: "Help us do better"


DG CONNECT's Stakeholder Survey has been launched and remains open until 2 December. The aim of this survey is to learn more about individuals, groups, organisations and entities who are involved in, or have views on, shaping Digital Agenda for Europe. It seeks more information on the areas stakeholders are interested in as well as rating the importance and level of satisfaction of certain aspects of interaction, for example easy access to information and the importance of timely and accurate feedback.

Therefore, DG CONNECT invites you to take part in this Survey to help us do our work and serve you better. The objectives are to:
- get a better understanding of who DG CONNECT's stakeholders are
- get an overview of how DG CONNECT is perceived
- identify areas of improvement
- establish a baseline scenario to measure progress
- Your answers will feed into a strategy, which aims at further developing DG CONNECT's relations with different types of stakeholders.

The survey will take from 5 to 10 minutes to complete and answers will remain anonymous.
In January 2013 the main findings of the survey will be published on the Digital Agenda for Europe website.

#daestakeholder

25 October 2012

Technology and the great ‘refreshment’ of learning and playing


There is widespread fear that technology kills more jobs than it creates. Obviously so in manufacturing where Europe has seen hundreds of thousands of jobs ‘relocate’ to emerging economies. Or increasingly become replaced by industrial robots. Now, innovation in knowledge technologies – e.g. the language technology industry– will also appear to be replacing what we thought of as unique human skills by increasingly cheap techno-fixes. 

Copy-editors are being (so far inadequately) replaced by digital spell and grammar checkers. Court interpreters might eventually find themselves waiting longer for a phone call when the judge can simply plug into a (still perfectible) speech to speech translation service. Instantaneous speech analytics can do a quicker job than quality inspectors in pinpointing anomalies in contact centre practices. And trading translators for machines has been a familiar complaint for years now.

In knowledge-intensive industries, there will always be a certain subset of procedures that can be modelled as an algorithm and automated inside a workflow – think of AI-driven medical diagnoses or running text analytics on a large corpus of customer complaints. But we are also learning that technology can aid the agile human brain to rediscover a certain pleasure in activities that were once thought to have been ‘solved’ by digital methods. What were once tedious jobs, difficult mental games, or hard-grind obligations such as rote-learning can now – thanks to computers - be transformed into pleasurable recreations.

Interestingly, the English word recreation comes from the Latin recreationem meaning "recovery from illness" which evolved by 1400 into "refreshing oneself by some amusement". Harvard economics professor Kenneth Rogoff has recently drawn attention to a curious twist in the recent history of chess-playing. It has largely been ‘refreshed’ or ‘re-created ‘under the effect of a technology that might well have caused its extinction:

Back-of-the-envelope calculations suggest that, worldwide, technological change could easily lead to the loss of 5-10 million jobs each year. Fortunately, until now, market economies have proved stunningly flexible in absorbing the impact of these changes. A peculiar but perhaps instructive example comes from the world of professional chess. (…) In 1997, the IBM computer Deep Blue defeated world chess champion Gary Kasparov in a short match. Soon, potential chess sponsors began to balk at paying millions of dollars to host championship matches between humans. (…) Nevertheless, a curious thing has happened: far more people make a living as professional chess players today than ever before. Thanks partly to the availability of computer programs and online matches, there has been a mini-boom in chess interest among young people in many countries. Many parents see chess as an attractive alternative to mindless video games. A few countries, such as Armenia and Moldova, have actually legislated the teaching of chess in schools.

Another item on the ‘technology is skill-destructive’ agenda is the signature human ability to use memory to carry out a complex skill such as learning a language. There are regular complaints that digital tech has replaced (memory-based) mathematical skills, for example, or that a digital speech-to-speech translator will eventually eliminate the need for language learning. 

What may really be happening, though, is that technology is releasing us from the cognitive burdens we associate specifically with work. And enabling us to ‘re-create’ these erstwhile functional skills such as memorising and language learning as recreational pleasures.

Ed Cooke, the CEO of Memrise, is trying to encourage communities to invent new ways to learn old human tricks such as memorising. In an interview, he said:

My cultural prediction is that the notion of learning is going to become increasingly detached from what is practical and increasingly linked to what is recreational and interesting. All we really try to do on the Internet is learn stuff, to understand what’s going on. Reading the Internet is an incredibly inefficient way of doing that – you can read Wikipedia for hours and end up with one anecdote. I think there will be some really interesting technology coming in the next few years to combine learning, reading and recreation. As long as we continue to think of learning as a functional thing then we’ll soon have to confront very soon the fact that we’re redundant as a species.  But if you think of learning in the way you think about having a conversation or going to see a film as a personal way to have fun and enrich yourself, then that’s a better to think about learning in the long term.

If it proves true, this shift from a functional to a recreational – some might say a ‘gaming’ - mind-set in the digital age could largely draw on language technology. Language learning, language making (the recreational construction of Klingon-type languages for games of all sorts or for hobbyist communities) and generally playing around with anything from writing systems to spoken dialects could form just one strand in the great refreshment of our personal and social lives through technology.

Author: Andrew Joscelyne