<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:admin="http://webns.net/mvcb/">
  <channel rdf:about="http://blog.gmane.org/gmane.comp.apache.uima.general">
    <title>gmane.comp.apache.uima.general</title>
    <link>http://blog.gmane.org/gmane.comp.apache.uima.general</link>
    <description/>
    <syn:updatePeriod>hourly</syn:updatePeriod>
    <syn:updateFrequency>1</syn:updateFrequency>
    <syn:updateBase>1901-01-01T00:00+00:00</syn:updateBase>
    <items>
      <rdf:Seq>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2703"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2702"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2700"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2696"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2687"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2684"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2682"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2666"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2665"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2658"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2657"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2653"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2651"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2650"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2649"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2647"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2641"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2639"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2635"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/2634"/>
      </rdf:Seq>
    </items>
    <image rdf:resource="http://gmane.org/img/gmane-25t.png"/>
    <textinput rdf:resource=""/>
  </channel>
  <image rdf:about="http://gmane.org/img/gmane-25t.png">
    <title>Gmane</title>
    <url>http://gmane.org/img/gmane-25t.png</url>
    <link>http://gmane.org</link>
  </image>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2703">
    <title>2nd CFP New Challenges for NLP Frameworks, a workshop at LREC 2010</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2703</link>
    <description>&lt;pre&gt;======================================================================

    2nd Call for Papers

    New Challenges for NLP Frameworks, a workshop at LREC 2010

    22 May 2010, La Valleta, Malta

    http://nlpframeworks2010.semanticsoftware.info

======================================================================

Natural language processing frameworks like GATE and UIMA have
significantly changed the way NLP applications are designed,
developed, and deployed. Features such as component-based design,
test-driven development, and resource meta-descriptions now routinely
provide higher robustness, better reusability, faster deployment, and
improved scalability. They have become the staple of both NLP
research and industrial application, fostering a new generation of
NLP users and developers.

Nevertheless, after more than a decade of the current generation of
NLP frameworks, the NLP research and application landscape is
shifting. This brings new challenges to both the developers of NLP
frameworks and their users.  Driving forces include in particular:

- Social Media

- Interoperability needs between different NLP frameworks,
  components, and resources

- Terabyte-Scale Data Sets

- Cloud and Grid Computing

- Semantic Computing, Ontologies, and Reasoning

- Cross-Media Language Analysis (text, speech, images, video)

- Ambient Computing

- Addressing more complex genres of language


THEMES AND TOPICS

This workshop will provide a venue for reporting ongoing work in the
context of NLP frameworks, such as UIMA, GATE, and other related systems.
Principal themes include:

- issues and approaches in processing of very large data collections, e.g.,
  parallelisation and distribution (particularly in relation to cloud
  computing)

- sophisticated tools to build and manage complex processing pipelines
and to
  analyse results

- software engineering in relation to language computation

- solutions to interoperability issues combining components from different
  sources (e.g., GATE, UIMA, NLTK, OpenNLP, NooJ)

- integration with related areas (data mining, semantic
  repositories, big table databases)

- persisting experimental contexts (computation and data), e.g. via
  virtualisation

- distribution of self-developed components, repositories of ready-to-use
  UIMA/GATE-based components

- efficient embedding of NLP processing in diverse environments (including
  small memory devices)

- research on genericity of components and type-system independence

- Service-Oriented Architecture (SOAs) and Software-as-a-Service
  (SaaS) models of language computation

- automatic feedback processes of knowledge discovery and reuse from text


INTENDED AUDIENCE

The workshop aims to bring together developers and users of NLP frameworks
from different perspectives, in order to elicit new requirements, feature
successful solutions, and exchange successful patterns of NLP engineering.
In particular, perspectives from the following user groups are welcome

- Application Developers, from both research and industry,
  with application experience reports

- Framework Developers, with an NLP/software engineering background

- Researcher users of NLP architectures


SUBMISSION FORMATS

We solicit the following types of publications:

Full research papers, describing novel, mature work, with an appropriate
level
of evaluation. Maximum of 8 pages in LREC format.

Short research papers, describing novel, early work, with preliminary
results;
as well as position papers or application experience reports. Length of
exactly 4 pages in LREC format.

Open source tool/resource papers, between 4-8 pages in LREC format. To
qualify for this category, the code or data must be accessible to the
reviewers and, if accepted, published together with the workshop under an
OSI-approved open source or open content license.

Note that the PC may suggest reassignment of a paper into a different
category depending on its contribution.

Your submission must be formatted according to LREC's authoring guidelines,
see http://www.lrec-conf.org/lrec2010/?Author-s-Kit-and-Templates

Submissions will be handled through the START system. When submitting a
paper
from the START page, authors will be asked to provide essential information
about resources (in a broad sense, i.e. also technologies, standards,
evaluation kits, etc.) that have been used for the work described in the
paper
or are a new result of your research.  For further information on this new
iniative, please refer to
http://www.lrec-conf.org/lrec2010/?LREC2010-Map-of-Language-Resources.


IMPORTANT DATES

February 12, 2010 - Deadline for workshop paper
March 8, 2010 - Notification of acceptance
March 18, 2010 - Camera-ready papers due
May 22, 2010 - Workshop in Malta


ORGANISERS

Rene Witte, Concordia University, Montréal
Hamish Cunningham, University of Sheffield
Jon Patrick, University of Sydney
Elena Beisswanger, University of Jena
Ekaterina Buyko, University of Jena
Udo Hahn, University of Jena
Karin Verspoor, University of Colorado Denver
Anni R. Coden, IBM T.J. Watson Research Center


PROGRAM COMMITTEE

Aaron Kaplan (Xerox, France)
Adam Funk (Uni. Sheffield)
Angus Roberts (Uni. Sheffield)
Anni R. Coden (IBM T.J. Watson Research Center)
Claude Roux (Xerox Research Labs)
Diana Inkpen (Uni Ottawa)
Diana Maynard (Uni. Sheffield)
Dietmar Rösner (Uni. Magdeburg)
Dragan Gasevic (Uni. Athabasca)
Ekaterina Buyko (Uni. Jena)
Elena Beisswanger (Uni. Jena)
Epaminondas Kapetanios (Uni Westminster)
Eric W. Brown (IBM T.J. Watson Research Center)
Graham Wilcock (Uni. Helsinki)
Guergana K. Savova (Mayo Clinic)
Hamish Cunningham (Uni. Sheffield)
Horacio Saggion (Uni. Sheffield)
Iryna Gurevych (Uni. Darmstadt)
Jian Su (I2R, Singapore)
Jochen Leidner (Thomson Reuters)
Jon Patrick (Uni. Sydney)
Juergen Rilling (Concordia Uni, Montréal)
Kalina Bontcheva (Uni. Sheffield)
Karin Verspoor (Uni. Colorado)
Katrin Tomanek (Uni. Jena)
Kevin B. Cohen (MITRE)
Leila Kosseim (Concordia Uni., Montréal)
Leo Ferres (Uni. of Concepcion)
Marc Light (Thomson Corp. R&amp;amp;D)
Michael Tanenblatt (IBM T.J. Watson Research Center)
Nancy Ide (Vassar College)
Nicolas Hernandez (Uni. Nantes)
Philip V. Ogren (Uni. Colorado)
Ralf Krestel (L3S Research Center, Hannover)
Rene Witte (Concordia Uni., Montréal)
Richard Eckart de Castilho (Uni. Darmstadt)
Sameer Pradhan (BBN)
Stefan Geißler (TEMIS GmbH)
Steven Bethard (Stanford Uni.)
Thilo Götz (IBM Germany)
Udo Hahn (Uni. Jena)
Valentin Tablan (Uni. Sheffield)
Yoshinobu Kano (Uni. Tokyo)
Yuntao Zhang (Shanghai Jiaotong Uni.)

&lt;/pre&gt;</description>
    <dc:creator>Elena Beisswanger</dc:creator>
    <dc:date>2010-02-05T20:51:44</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2702">
    <title>CPM still running after process</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2702</link>
    <description>&lt;pre&gt;Hello,

Well the problem is on the title.

after mCPM.process() the CPM seems to still run.

How to figure out this issue ?

Regards

Rad
&lt;/pre&gt;</description>
    <dc:creator>Radwen ANIBA</dc:creator>
    <dc:date>2010-02-05T12:23:35</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2700">
    <title>telling the CPM to consider some changes</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2700</link>
    <description>&lt;pre&gt;Hello,

I come back with a problem I have to run a CPM programmatically.

This is what I did :

I am based on the uima default FileSystemCollectionReader descriptor that I
call using

ResourceSpecifier colReaderSpecifier =
UIMAFramework.getXMLParser().parseCollectionReaderDescription(new
XMLInputSource("desc/FileSystemCollectionReader.xml"));
CollectionReader collectionReader =
UIMAFramework.produceCollectionReader(colReaderSpecifier);

Then I developed 4 analysis engines that I call like this

((BaseCPMImpl) mCPM).addCasProcessor(ae1);
((BaseCPMImpl) mCPM).addCasProcessor(ae2);
((BaseCPMImpl) mCPM).addCasProcessor(ae3);
((BaseCPMImpl) mCPM).addCasProcessor(ae4);


the problem is that I want to give the user the option to tell the
collection reader the folder he want to use containing the documents to be
analyzed so i used this method after producing the collection reader

 ConfigurationParameterSettings settings =
collectionReader.getMetaData().getConfigurationParameterSettings();

     org.apache.uima.resource.metadata.NameValuePair[] valuePairs =
settings.getParameterSettings();

     for (org.apache.uima.resource.metadata.NameValuePair nvp : valuePairs)
{

         // TODO: customize settings and save changes back using this crappy
CPE API


if(nvp.getName().matches("InputDirectory"))nvp.setValue("/path/to/test/if/that/work");


     }

And unfortunately the CPM seems to ignore this change and I think I'm
missing something here.

How to tell the CPM that I've changed the collection reader's configuration
paramater settings ?

Thx
Rad
&lt;/pre&gt;</description>
    <dc:creator>Radwen ANIBA</dc:creator>
    <dc:date>2010-02-05T10:53:35</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2696">
    <title>CPM class usage</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2696</link>
    <description>&lt;pre&gt;Hello,

I have devoloped a serie of AEs that I tested through CPE GUI,and now I'm
trying to write a java aplication using these AEs and trying to do th same
thing as CPE GUI. So I took as reference SimpleRunCPM example and I made
these changes

 // create a new Collection Processing Manager
    mCPM = UIMAFramework.newCollectionProcessingManager();

    // Register AE and CAS Consumer with the CPM
    mCPM.setAnalysisEngine(ae1);
    mCPM.setAnalysisEngine(ae2);
    mCPM.setAnalysisEngine(ae3);
    mCPM.setAnalysisEngine(ae4);

Notice here I have 4 ae and not only one

So when running this I have an error message in mCPM.setAnalysisEngine(ae2)
saying

Initializing AnalysisEngines
Exception in thread "main" java.lang.IndexOutOfBoundsException: Index: 0,
Size: 0
    at java.util.LinkedList.entry(LinkedList.java:365)
    at java.util.LinkedList.remove(LinkedList.java:357)
    at
org.apache.uima.collection.impl.cpm.engine.CPMEngine.removeCasProcessor(CPMEngine.java:1188)
    at
org.apache.uima.collection.impl.cpm.BaseCPMImpl.removeCasProcessor(BaseCPMImpl.java:361)
    at
org.apache.uima.collection.impl.cpm.CPMImpl.setAnalysisEngine(CPMImpl.java:70)

So I don't know what is really the problem, if CPM do not take up to one ae
or this is a problem in ae2 that I already tried without any bug.

Any idea ?

Thx

Rad
&lt;/pre&gt;</description>
    <dc:creator>Radwen ANIBA</dc:creator>
    <dc:date>2010-02-03T10:40:08</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2687">
    <title>LREC Workshop "New Challenges for NLP Frameworks" - 2nd Call for Papers</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2687</link>
    <description>&lt;pre&gt;======================================================================

   2nd Call for Papers

   New Challenges for NLP Frameworks, a workshop at LREC 2010

   22 May 2010, La Valleta, Malta

   http://nlpframeworks2010.semanticsoftware.info

======================================================================

Natural language processing frameworks like GATE and UIMA have
significantly changed the way NLP applications are designed,
developed, and deployed. Features such as component-based design,
test-driven development, and resource meta-descriptions now routinely
provide higher robustness, better reusability, faster deployment, and
improved scalability. They have become the staple of both NLP
research and industrial application, fostering a new generation of
NLP users and developers.

Nevertheless, after more than a decade of the current generation of
NLP frameworks, the NLP research and application landscape is
shifting. This brings new challenges to both the developers of NLP
frameworks and their users.  Driving forces include in particular:

- Social Media

- Interoperability needs between different NLP frameworks,
 components, and resources

- Terabyte-Scale Data Sets

- Cloud and Grid Computing

- Semantic Computing, Ontologies, and Reasoning

- Cross-Media Language Analysis (text, speech, images, video)

- Ambient Computing

- Addressing more complex genres of language


THEMES AND TOPICS

This workshop will provide a venue for reporting ongoing work in the
context of NLP frameworks, such as UIMA, GATE, and other related systems.
Principal themes include:

- issues and approaches in processing of very large data collections, e.g.,
 parallelisation and distribution (particularly in relation to cloud
 computing)

- sophisticated tools to build and manage complex processing pipelines
and to
 analyse results

- software engineering in relation to language computation

- solutions to interoperability issues combining components from different
 sources (e.g., GATE, UIMA, NLTK, OpenNLP, NooJ)

- integration with related areas (data mining, semantic
 repositories, big table databases)

- persisting experimental contexts (computation and data), e.g. via
 virtualisation

- distribution of self-developed components, repositories of ready-to-use
 UIMA/GATE-based components

- efficient embedding of NLP processing in diverse environments (including
 small memory devices)

- research on genericity of components and type-system independence

- Service-Oriented Architecture (SOAs) and Software-as-a-Service
 (SaaS) models of language computation

- automatic feedback processes of knowledge discovery and reuse from text


INTENDED AUDIENCE

The workshop aims to bring together developers and users of NLP frameworks
from different perspectives, in order to elicit new requirements, feature
successful solutions, and exchange successful patterns of NLP engineering.
In particular, perspectives from the following user groups are welcome

- Application Developers, from both research and industry,
 with application experience reports

- Framework Developers, with an NLP/software engineering background

- Researcher users of NLP architectures


SUBMISSION FORMATS

We solicit the following types of publications:

- full research papers (6-8 pages in LREC format)

- short papers (3-4 pages to be presented as demos/posters)

- open source tool/resource papers (full or short, must be accompanied by
 working code or accessible data)

Submission will be handled through the START system. When submitting a
paper
from the START page, authors will be asked to provide essential information
about resources (in a broad sense, i.e. also technologies, standards,
evaluation kits, etc.) that have been used for the work described in the
paper
or are a new result of your research.  For further information on this new
iniative, please refer to
http://www.lrec-conf.org/lrec2010/?LREC2010-Map-of-Language-Resources.


IMPORTANT DATES

February 12, 2010 - Deadline for workshop paper
March 8, 2010 - Notification of acceptance
March 18, 2010 - Camera-ready papers due
May 22, 2010 - Workshop in Malta


ORGANISERS

Rene Witte, Concordia University, Montréal
Hamish Cunningham, University of Sheffield
Jon Patrick, University of Sydney
Elena Beisswanger, University of Jena
Ekaterina Buyko, University of Jena
Udo Hahn, University of Jena
Karin Verspoor, University of Colorado Denver
Anni R. Coden, IBM T.J. Watson Research Center


PROGRAM COMMITTEE

Aaron Kaplan (Xerox, France)
Adam Funk (Uni. Sheffield)
Angus Roberts (Uni. Sheffield)
Anni R. Coden (IBM T.J. Watson Research Center)
Claude Roux (Xerox Research Labs)
Diana Inkpen (Uni Ottawa)
Diana Maynard (Uni. Sheffield)
Dietmar Rösner (Uni. Magdeburg)
Dragan Gasevic (Uni. Athabasca)
Ekaterina Buyko (Uni. Jena)
Elena Beisswanger (Uni. Jena)
Epaminondas Kapetanios (Uni Westminster)
Eric W. Brown (IBM T.J. Watson Research Center)
Graham Wilcock (Uni. Helsinki)
Guergana K. Savova (Mayo Clinic)
Hamish Cunningham (Uni. Sheffield)
Horacio Saggion (Uni. Sheffield)
Iryna Gurevych (Uni. Darmstadt)
Jian Su (I2R, Singapore)
Jochen Leidner (Thompson Reuters)
Jon Patrick (Uni. Sydney)
Juergen Rilling (Concordia Uni, Montréal)
Kalina Bontcheva (Uni. Sheffield)
Kano Yoshinobu (Uni. Tokyo, Tsujii Lab)
Karin Verspoor (Uni. Colorado)
Katrin Tomanek (Uni. Jena)
Kevin B. Cohen (MITRE)
Leila Kosseim (Concordia Uni., Montréal)
Leo Ferres (Uni. of Concepcion)
Marc Light (Thomson Corp. R&amp;amp;D)
Michael Tanenblatt (IBM T.J. Watson Research Center)
Nancy Ide (Vassar College)
Nicolas Hernandez (Uni. Nantes)
Philip V. Ogren (Uni. Colorado)
Ralf Krestel (L3S Research Center, Hannover)
Rene Witte (Concordia Uni., Montréal)
Richard Eckart de Castilho (Uni. Darmstadt)
Sameer Pradhan (BBN)
Stefan Geißler (TEMIS GmbH)
Steven Bethard (Stanford Uni.)
Thilo Götz (IBM Germany)
Udo Hahn (Uni. Jena)
Valentin Tablan (Uni. Sheffield)
Yuntao Zhang (Shanghai Jiaotong Uni.)


&lt;/pre&gt;</description>
    <dc:creator>Ekaterina Buyko</dc:creator>
    <dc:date>2010-02-01T12:16:47</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2684">
    <title>AUTO: Igor Sominsky is out of the office (returning Mon 02/08/2010)</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2684</link>
    <description>&lt;pre&gt;

I am out of the office from Sun 01/31/2010 until Mon 02/08/2010.




Note: This is an automated response to your message  "Getting
non-annotations (e.g. TOP) from a CAS" sent on 2/1/10 3:50:12.

This is the only notification you will receive while this person is away.&lt;/pre&gt;</description>
    <dc:creator>Igor Sominsky</dc:creator>
    <dc:date>2010-02-01T09:01:13</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2682">
    <title>Getting non-annotations (e.g. TOP) from a CAS</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2682</link>
    <description>&lt;pre&gt;Hello folks,

after upgrading to UIMA 2.3.0, I notice that all (J)CAS access methods I found so far (including indexes) always return Annotation or AnnotationFS.

How can one get access now to types that directly inherit from TOP?

Cheers,

Richard

&lt;/pre&gt;</description>
    <dc:creator>Richard Matthias Eckart de Castilho</dc:creator>
    <dc:date>2010-02-01T08:50:12</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2666">
    <title>XMI XML XSD?</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2666</link>
    <description>&lt;pre&gt;I know that the XMI schema is not defined by UIMA, but does anyone happen  
to know of an XML schema definition (.xsd) file for XMI?  It might make  
parsing XMI XML easier.

Thanks,

Greg Holmberg

&lt;/pre&gt;</description>
    <dc:creator>Greg Holmberg</dc:creator>
    <dc:date>2010-01-29T18:44:31</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2665">
    <title>UIMA-AS binary serialization</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2665</link>
    <description>&lt;pre&gt;Hi UIMA users--

I see in the README for 2.3 that UIMA-AS uses a new, efficient binary  
serialization for remote services.

I couldn't find much information about it in the Async Scaleout docs.  It  
was briefly mentioned as a configuration option, but not described.

Is this the same format that is used to serialize to C++?

If not, where can I find more information?

Must the recipient re-constitute the CAS, or is it self-describing like  
XML and could be handled by a non-UIMA recipient?

Thanks,

Greg Holmberg


&lt;/pre&gt;</description>
    <dc:creator>Greg Holmberg</dc:creator>
    <dc:date>2010-01-29T18:40:34</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2658">
    <title>UIMA AS: Duplicate Request</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2658</link>
    <description>&lt;pre&gt;Hi,

there is this message in the service logs:
1/29/10 3:09:40 AM - 16: 
org.apache.uima.aae.handler.input.ProcessRequestHandler_impl.handleProcessRequestFromRemoteClient: 
INF
O: Duplicate Request With Cas Reference Id: 2d0e21bb:12664a22384:7eac 
Received. Ignoring Duplicate.

What does it mean ? And how can it happen ?

Jörn

&lt;/pre&gt;</description>
    <dc:creator>Jörn Kottmann</dc:creator>
    <dc:date>2010-01-29T09:26:54</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2657">
    <title>[Announce] Apache UIMA 2.3.0 released</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2657</link>
    <description>&lt;pre&gt;The Apache UIMA development community is pleased to announce the release
of version 2.3.0 of UIMA (Unstructured Information Management
Architecture).  Apache UIMA is a framework supporting combining and
reusing components that annotate unstructured information content such
as text, audio, and video.

This release consists of 4 packages:

 - UIMA Java SDK - the base framework, with development tools and examples
 - UIMA-AS (Asynchronous Scalout capability)
 - UIMACPP (c++ support framework, for components written in c++ and
other languages)
 - UIMA Addons - a growing set of annotators and other tools.

This release is generally backwards compatable with previous releases,
except that Java 5 is now the minimum Java level required.

The add-ons package contains many new components and annotators, including:

  - Bean Scripting Framework supporting annotators written in popular
scripting languages
  - Lucas - an interface to using UIMA with Apache Lucene
  - TikaAnnotator - an annotator using the Apache Tika project text
extractors

The UIMA-AS (Asynchronous Scaleout) framework is extensively enhanced
with much more support for error/failure recovery, driven by feedback
from actual use in several large scale deployments (1000's of nodes).
The base framework now supports Java 5 generics, and is enhanced to make
it even more light-weight and efficient; for example, it now supports a
new network serialization format for communicating with remote
annotators using a "delta-CAS" - limiting the response sent to just
those items which have changed.

Full information and summaries of the changes are contained in the
release notes, which you can find on our downloads page - scroll down to
the 2.3.0 release section, and click on the package of interest in the
release notes column.

Apache UIMA welcomes your help.  Any contribution (code, testing,
documentation, bug reporting/fixing) is always appreciated.  For more
information on how to get involved, please visit the website at:

  http://incubator.apache.org/uima

Thank you for your interest in Apache UIMA.

-The Apache UIMA development community



&lt;/pre&gt;</description>
    <dc:creator>Marshall Schor</dc:creator>
    <dc:date>2010-01-28T22:58:21</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2653">
    <title>AE parameters don't change</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2653</link>
    <description>&lt;pre&gt;Hi

I'm trying to change a parameter in an AE and then to call the CPE that uses
this AE but the changes are not made despite the fact that I don't have
errors. Here is the part of code I'm using

ResourceSpecifier aeSpecifier;
        try {
            aeSpecifier =
UIMAFramework.getXMLParser().parseResourceSpecifier(new
XMLInputSource("desc/TestSeePredAED.xml"));
            AnalysisEngine ae =
UIMAFramework.produceAnalysisEngine(aeSpecifier);

            ConfigurationParameterSettings aesettings =
ae.getMetaData().getConfigurationParameterSettings();

             NameValuePair[] valuePairs2 =
aesettings.getParameterSettings();

             for (NameValuePair nvp2 : valuePairs2) {



if(nvp2.getName().matches("OUTPUTDIR"))nvp2.setValue("/home/radwen/Bureau/Fold");

                 System.out.format("name='%s'; value='%s'\n",

                         nvp2.getName(), nvp2.getValue());

             }

          ae.reconfigure();


But after calling my CPE programmaticaly just after this code, well the
changes seems to be not made.

Can you help me figuring this out ?

Thx

Radwen
&lt;/pre&gt;</description>
    <dc:creator>Radwen ANIBA</dc:creator>
    <dc:date>2010-01-28T17:16:24</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2651">
    <title>Parameters and Java</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2651</link>
    <description>&lt;pre&gt;Hello,

When we set some parameters in an analysis engine descriptor, how do we call
these variables in the annotator ? (java) ?

Thx
&lt;/pre&gt;</description>
    <dc:creator>Radwen ANIBA</dc:creator>
    <dc:date>2010-01-28T16:36:23</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2650">
    <title>Change programmatically parameters</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2650</link>
    <description>&lt;pre&gt;Hi,

I would like to know how to change programmatically a parameter that comes
in the descriptor of an analysis engine.
I've set a mandatory parameter in the descriptor but it will depend on the
final user choice and would like to let him specify (through the main class)
a value that will replace the default parameter.

How can we do this programmatically ?

Thanks

Rad
&lt;/pre&gt;</description>
    <dc:creator>Radwen ANIBA</dc:creator>
    <dc:date>2010-01-28T15:33:44</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2649">
    <title>FeaturePath's built-in function "coveredText()"</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2649</link>
    <description>&lt;pre&gt;Hello,

I'd like to known how to use the built-in function "coveredText()" described in the
API of the FeaturePath interface? For example, I'd like to select annotations of
type uima.tcas.Annotation covering the text "UIMA". How can I do that?

Thanks in advance,
Jérôme

&lt;/pre&gt;</description>
    <dc:creator>Jérôme Rocheteau</dc:creator>
    <dc:date>2010-01-28T13:42:14</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2647">
    <title>Create a new parameter programmatically for aggregated analysis engine</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2647</link>
    <description>&lt;pre&gt;Hello again.

I want to share a new parameter with all annotators of all analysis engines 
inside an aggregated analysis engine. This should be done programmatically. The 
problem is, that aggregated analysis engines can only overwrite existing 
parameters. Is there any chance to bypass this?

I tried to directly add a ConfigurationParameter to 
ConfigurationParameterDeclaration of the aggregated analysis engine, but that 
doesn't seem to work (as no new parameters are allowed for the above reason).

Best regards,
Kai


&lt;/pre&gt;</description>
    <dc:creator>Kai Schlamp</dc:creator>
    <dc:date>2010-01-28T12:35:58</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2641">
    <title>Set parameters programmatically before creating the analysis engine</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2641</link>
    <description>&lt;pre&gt;Hello.

In my application the analysis engine parameters can only be set 
programmatically. This is normally done after the analysis engine is produced. 
After setting the parameters it is reconfigured. So far so good.
The problem is now that one specific analysis engine throws an exception when it 
is produced as it contains by default some invalid parameters.
Is there a way to specify the parameters of an analysis engine before it is 
produced?

URL descriptorUrl = bundle.getResource(descriptorPath);
XMLInputSource input = new XMLInputSource(descriptorUrl);
ResourceSpecifier resourceSpecifier = 
UIMAFramework.getXMLParser().parseResourceSpecifier(input);
resourceSpecifier.AnalysisEngine analysisEngine = 
UIMAFramework.produceAnalysisEngine(resourceSpecifier);
--&amp;gt; here comes the ResourceInitializationException

Best regards,
Kai


&lt;/pre&gt;</description>
    <dc:creator>Kai Schlamp</dc:creator>
    <dc:date>2010-01-28T02:00:38</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2639">
    <title>CASImpl.createFilteredIterator performance / Eclipse source plugin</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2639</link>
    <description>&lt;pre&gt;Hello,

I have some performance issues with a current application. The profiler 
tells me that over 80% of the execution time was spent on the about 200 
calls of the method CASImpl.createFilteredIterator(). These 80% are 
sometimes more than 1000s for one AE.process() and there is a lot more 
moving on the index going on within those 20%.

I can't investigate the cause for this performace hot spot any further, 
also because I am missing the source plugins for UIMA runtime plugin. 
The application is running within Eclipse. My first question: Is there 
an easy way to get/create a source plugin for the UIMA core/runtime? At 
best without using maven? Any best practices for profiling UIMA in Eclipse?

My second question: Is that a normal behavior or can anyone give me a 
hint how I could increase the performance?

Some exemplary information about the usage of the method:
The CAS contains about 40 pages of plain text with about 50 lines per 
page. Part of the text (maybe 3 pages) is annotated and for each line of 
the segment the methods createFilteredIterator() is called with some 
constraints about types and of course about the window of the iterator 
(that is the line). I also tried the replace the filtered iterator with 
a window constraint with a filtered iterator of a subiterator of the 
annotation index resulting in no real improvement of performance. The 
UIMA version is 2.2.2

Looking forward to some hint or directions.

Peter

&lt;/pre&gt;</description>
    <dc:creator>Peter Klügl</dc:creator>
    <dc:date>2010-01-27T12:37:30</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2635">
    <title>UIMA presentation in NYC - Feb 24</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2635</link>
    <description>&lt;pre&gt;Hello,

If you are in or near New York City on Feb 24, 2010, you may want to come to the UIMA presentation by Dr. Pablo Duboue (he's a subscriber here):

  http://www.meetup.com/NYC-Search-and-Discovery/calendar/12384559/


This is an "excerpt from the blurb" about Pablo's talk:

In this talk, I will briefly present UIMA basics before discussing full
UIMA systems I have been involved in the past (including our Expert
Search system in TREC Enterprise Track 2007). I will be talking about
how UIMA supported the construction of our custom NLP tools. I will
also sketch the new characteristics of the UIMA Asynchronous Scaleout
(UIMA AS) subproject that enable UIMA to run Analysis Engines in
thousands of machines.

Otis
--
Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch


&lt;/pre&gt;</description>
    <dc:creator>Otis Gospodnetic</dc:creator>
    <dc:date>2010-01-26T18:55:39</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2634">
    <title>How to process with aggregate descriptors</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2634</link>
    <description>&lt;pre&gt;Hi,
I'm trying to use an example aggregate descriptor 
(RoomNumberAndDateTime) using this simple code:

        XMLInputSource input = new 
XMLInputSource("desc/analysis_engine/RoomNumberAndDateTime.xml");
        AnalysisEngineDescription desc = 
UIMAFramework.getXMLParser().parseAnalysisEngineDescription(input);
             
        AnalysisEngine seAnnotator = 
UIMAFramework.produceAnalysisEngine(desc);
              
        JCas cas = seAnnotator.newJCas();
        cas.setDocumentText("August 26, 2003 \n UIMA 101 - The New UIMA 
Introduction \n (Hands-on Tutorial) \n 9:00AM-5:00PM in HAW GN-K35\n 
August 28, 2003");

        seAnnotator.process(cas);
                       
        FSIndex indexAnnotations = cas.getAnnotationIndex();
        Iterator&amp;lt;Annotation&amp;gt; annoIt = indexAnnotations.iterator();

if I iterate on annoIt I find only annotations for RoomNumber but not 
for Date or Time (as the DocumentAnalyzer do), what I'm doing wrong?

Thanks

Steve

&lt;/pre&gt;</description>
    <dc:creator>Steve Petruzza</dc:creator>
    <dc:date>2010-01-26T17:45:24</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/2632">
    <title>Block Selection in Document Analyzer</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/2632</link>
    <description>&lt;pre&gt;Hi,

I am very novice to the UIMA and I have question on Document Analyzer.

Is it possible to highlight a rectangular region of text which have
co-ordinates x0, y0, x1, y1 which might span multiple lines (y0-y1) but
x0-x1 denotes only part of the text whole line text?

You answer is very much appretiated.

Thanks and Regards
Ram Mohan
&lt;/pre&gt;</description>
    <dc:creator>Ram Mohan Yaratapally</dc:creator>
    <dc:date>2010-01-25T11:36:58</dc:date>
  </item>
  <textinput rdf:about="http://search.gmane.org/?group=$group=gmane.comp.apache.uima.general">
    <title>Search Engine</title>
    <description>Search the mailing list at Gmane</description>
    <name>query</name>
    <link>http://search.gmane.org/?group=$group=gmane.comp.apache.uima.general</link>
  </textinput>
</rdf:RDF>
