<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:admin="http://webns.net/mvcb/">
  <channel rdf:about="http://blog.gmane.org/gmane.comp.apache.uima.general">
    <title>gmane.comp.apache.uima.general</title>
    <link>http://blog.gmane.org/gmane.comp.apache.uima.general</link>
    <description/>
    <syn:updatePeriod>hourly</syn:updatePeriod>
    <syn:updateFrequency>1</syn:updateFrequency>
    <syn:updateBase>1901-01-01T00:00+00:00</syn:updateBase>
    <items>
      <rdf:Seq>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4141"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4137"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4129"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4128"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4127"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4122"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4117"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4115"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4113"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4109"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4108"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4101"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4097"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4082"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4079"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4069"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4065"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4063"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4061"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.apache.uima.general/4058"/>
      </rdf:Seq>
    </items>
    <image rdf:resource="http://gmane.org/img/gmane-25t.png"/>
    <textinput rdf:resource=""/>
  </channel>
  <image rdf:about="http://gmane.org/img/gmane-25t.png">
    <title>Gmane</title>
    <url>http://gmane.org/img/gmane-25t.png</url>
    <link>http://gmane.org</link>
  </image>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4141">
    <title>UTF8 Encoded documents processing</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4141</link>
    <description>&lt;pre&gt;dear all
I have Unicode document I want to process.
Following the tutorial at
this&amp;lt;http://www.ibm.com/developerworks/webservices/tutorials/ws-uima/&amp;gt;,
the code stucks at the last line.

                        File taeDescriptor = new
File("desc\\DateAnnotatorAEDescriptor.xml");
File inputFile = new File("data\\document1.txt");
XMLInputSource in = new XMLInputSource(taeDescriptor);
ResourceSpecifier specifier =
UIMAFramework.getXMLParser().parseResourceSpecifier(in);
AnalysisEngine tae = UIMAFramework.produceAnalysisEngine(specifier);
CAS tcas = tae.newCAS();
FileInputStream fis = new FileInputStream(inputFile);
byte[] contents = new byte[(int) inputFile.length()];
fis.read(contents);
fis.close();
String document = new String(contents);
tcas.setDocumentText(document);
*tae.process(tcas);*

thank you.
&lt;/pre&gt;</description>
    <dc:creator>Seid Muhie</dc:creator>
    <dc:date>2012-05-26T21:13:10</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4137">
    <title>Test Uima</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4137</link>
    <description>&lt;pre&gt;We are interested installing and using UIMA as an entity extraction
engine. Before we do that, I'd like to test the system to see the
results. Could I send 25-100 documents , name your format, to someone to
put through their UIMA engine to test. Results sent back to me to
review. It would be greatly appreciated. Thanks!

 

Burt Buchman

Solution Architect - Editorial

EBSCO Publishing

Phone: 978-356-6500, ext 2346

bbuchman-TsyAZbWhrljby3iVrkZq2A&amp;lt; at &amp;gt;public.gmane.org

 

&lt;/pre&gt;</description>
    <dc:creator>Burt Buchman</dc:creator>
    <dc:date>2012-05-23T19:03:09</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4129">
    <title>Maven ERROR</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4129</link>
    <description>&lt;pre&gt;Hi

         I read UIMA Overview &amp;amp; SDK Setup ,and try to download UIMA with
maven, 

but ULR is ERROR, it says that
http://www.apache.org/dist/uima/eclipse-update-site/ , something is lost and
could not found, how to deal with it,

         

         could anyone help me?

 

------------

Tian

 

 

&lt;/pre&gt;</description>
    <dc:creator>tgh</dc:creator>
    <dc:date>2012-05-24T01:01:30</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4128">
    <title>Recognize patterns from samples</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4128</link>
    <description>&lt;pre&gt;Hello all,
I am new to this framework, and to this topic in general.
My requirement is to build a component that can take unstructured html
documents, and extract data. Something like this can be built with a regular
html parser.
However due the number of different html document types, building something
like this by hand is a time consuming, especially if there is a way to generate
a parser automatically from training data and apply incremental learning as new
samples are proven valid. 
Many of the documents I am looking to structure and extract data from, contains
financial data (currency, numbers, dates and times .. etc).

So my first question is, Can UIMA help ? I did some reading about opennlp, and
got lost, which one is closer to what I need if any.

Thank you a lot, for your time.



&lt;/pre&gt;</description>
    <dc:creator>Mansour</dc:creator>
    <dc:date>2012-05-18T07:06:35</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4127">
    <title>Testing UIMA</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4127</link>
    <description>&lt;pre&gt;Hi all:

We are interested in testing UIMA and wondering if there is any site
available for that? Or would we have to install it to then test it out?
Would appreciate any insight. Thanks!

 

Burt Buchman

Solution Architect - Editorial

EBSCO Publishing

Phone: 978-356-6500, ext 2346

bbuchman-TsyAZbWhrljby3iVrkZq2A&amp;lt; at &amp;gt;public.gmane.org

 

&lt;/pre&gt;</description>
    <dc:creator>Burt Buchman</dc:creator>
    <dc:date>2012-05-16T14:20:02</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4122">
    <title>uimaFIT 1.3.1 - AnalysisEngineFactory.setConfigurationParameters deprecated</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4122</link>
    <description>&lt;pre&gt;Hi,

Since version 1.3.1 of uimaFIT
org.uimafit.factory.AnalysisEngineFactory.setConfigurationParameters is
marked as deprecated. What should I use instead?

Regards,

Armin

&lt;/pre&gt;</description>
    <dc:creator>Armin.Wegner-w0Tf80GBFXAb1SvskN2V4Q&lt; at &gt;public.gmane.org</dc:creator>
    <dc:date>2012-05-15T11:04:10</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4117">
    <title>Maven UIMA and import by name</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4117</link>
    <description>&lt;pre&gt;Hello all,

I have a question on how you deal with a specific use case and would like to know if you have some suggestions for me.

I use Maven for all my Java projects and so I do for my UIMA related projects. Now I have a quite large pipeline with lots of descriptors. They reside in (or subdirectories of) the 'desc' directory of the 'UIMA nature' structure.
Currently I am about to pack these single-AE descriptors into aggregates. For importing all single-AEs into the AAE descriptor, I would like to use "import by name". However, the 'desc' directory is not a library for eclipse and thus, the AAE descriptor editor doesn't list the descriptors residing in this directory - I can't add them (and when I edit the XML, I get error messages about descriptors not found).

I would like to just add the 'desc' directory to the build path as an "class folder" (not a source folder, this won't work), i.e. as a library. When I do this manually, Maven would overwrite it the next time it updates my project configuration.

H&lt;/pre&gt;</description>
    <dc:creator>Erik Fäßler</dc:creator>
    <dc:date>2012-05-11T15:56:09</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4115">
    <title>How to load XMI CAS from a DB inside a Reader AE?</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4115</link>
    <description>&lt;pre&gt;Hi all,

I have a database which contains XMIs and would like to process
these with UIMA-AS.

My pipeline looks like this:
Database Reader AE
Result Writer AE

I made an AAE composed out of the two AEs, the first gets a CAS with an
ID encoded into its sofa. It then fetches the XMI from the DB, and calls
XmiCasDeserializer.deserialize to load it into the CAS.
After that it inserts an ID FeatureStructure into the CAS (via createFS 
and addFsToIndexes).

The second AE should just access the content of the CAS, but the ID FS added
by the first AE is missing. Why is it missing? (input/output 
capabilities are not specified)
The content which is in the XMI can be accessed.

Is it safe to use XmiCasDeserializer.deserialize? Or should it be done
in some other way?

Jörn

&lt;/pre&gt;</description>
    <dc:creator>Jörn Kottmann</dc:creator>
    <dc:date>2012-05-08T17:03:31</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4113">
    <title>Submission Deadline Extension</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4113</link>
    <description>&lt;pre&gt;we apologize if you receive multiple copies of this CFP

===================================================================

CALL FOR PAPERS

7th Workshop on

Virtualization in High-Performance Cloud Computing

VHPC '12

as part of Euro-Par 2012, Rhodes Island, Greece

===================================================================

Date: August 28, 2012

Workshop URL: http://vhpc.org

SUBMISSION DEADLINE:

June 11, 2012 - Full paper submission (extended)


SCOPE:

Virtualization has become a common abstraction layer in modern
data centers, enabling resource owners to manage complex
infrastructure independently of their applications. Conjointly,
virtualization is becoming a driving technology for a manifold of
industry grade IT services. The cloud concept includes the notion
of a separation between resource owners and users, adding  services
such as hosted application frameworks and queueing. Utilizing the
same infrastructure, clouds carry significant potential for use in
high-performance scientific com&lt;/pre&gt;</description>
    <dc:creator>VHPC 12</dc:creator>
    <dc:date>2012-05-06T17:39:44</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4109">
    <title>obtaining xmi:id from tag.</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4109</link>
    <description>&lt;pre&gt;Hello,
I'm having difficulty extracting the xmi:id attribute value from each of my 
elements in my xmi file.  Assuming there are xmi:id attribute values, how do I 
extract it's value from the method below

public void processCas(CAS aCAS) throws ResourceProcessException {
    ???
}

James


&lt;/pre&gt;</description>
    <dc:creator>James Moliere</dc:creator>
    <dc:date>2012-05-03T17:05:32</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4108">
    <title>Advice for running UIMA AS as Windows Apache Procrun Service</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4108</link>
    <description>&lt;pre&gt;Hello!
   I would like to use the Apache Procrun project (
http://commons.apache.org/daemon/procrun.html) to run the UIMA AS pipeline
as Windows Service. :
I.A first solution is to Try is to modify the current UIMA_Service class
and to adapt it to the Apache Procrun requirements (adding start and stop
methods) and use this altered class as Java Class for the Windows Service.
     But, looking up to the current implementation, I've noticed that the
UIMA_Service is launched by org.apache.uima.bootstrap.UimaBootstrap class.
     I'm wondering if this launching part is a requirement of the UIMA AS
framework. Can I skip this step and launch directly the My_UIMA_Service
main class?


  Can you please advice me, how I can implement my use case according to
the UIMA AS specifications?

II. A second solution is trying to use somehow the  current starting script
deployAsyncService.cmd to launch it as windows service.But the following
issues raised:
   1. Is this solution feasible? The script starts the UIMA in command&lt;/pre&gt;</description>
    <dc:creator>Spico Florin</dc:creator>
    <dc:date>2012-05-03T13:57:46</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4101">
    <title>how to get the original filename of the input document?</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4101</link>
    <description>&lt;pre&gt;Hello

I did my first aggregate AE with several Annotators. All works fine but, now, I
need the original filename (or filepath) of the input Document. I did several
tries with getSofaDataURI() = but it is null, and other Annotations in the JCas
but without success. While I can with success obtain mime-type, language, etc...
I did a xml serialization of the JCas and I can see the filepath in a
&amp;lt;string&amp;gt;...document.txt&amp;lt;/string&amp;gt; tag. How can I access it?

many thanks
Michelangelo


&lt;/pre&gt;</description>
    <dc:creator>michelangelo</dc:creator>
    <dc:date>2012-05-01T21:32:58</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4097">
    <title>Synchronizing the initialization of a component in an aggregate</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4097</link>
    <description>&lt;pre&gt;Hi All,

I am trying to write a primitive analysis engine that checks, and
creates or updates a database schema based on the type system. I need
to synchronize the initialization of this component so that only one
instance of the component will do this task when there are multiple
instances being instantiated.

What is the correct object to synchronize on?  Is the type system
object the correct one and does it maintain its identity throughout a
JVM run?  Is it a different object in the other aggregates even though
they use the same type system description?

I need to block all other threads in the other instances of the
current component until the database is updated.  I also need this
object to be specific to the current aggregate so that other
aggregates running in the same JVM can have their own synchronization
objects and database updates independent of each other. In other
words, I can't use a JVM wide object.

Thank you,
Shahim

&lt;/pre&gt;</description>
    <dc:creator>Shahim Essaid</dc:creator>
    <dc:date>2012-04-30T18:01:50</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4082">
    <title>Berlin Buzzwords program is online</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4082</link>
    <description>&lt;pre&gt;This is to announce the Berlin Buzzwords program. The Program Committee has 
completed reviewing all submissions and set up the schedule containing a great 
lineup of speakers for this years Berlin Buzzwords program. Among the speakers 
we have Leslie Hawthorn (Red Hat), Alex Lloyd (Google), Michael Busch (Twitter) 
as well as Nicolas Spiegelberg (Facebook). Checkout our program at 
http://berlinbuzzwords.de/program/session-schedule 

Berlin Buzzwords standard conference tickets are still available. Note that we 
also offer a special rate for groups of 5 and more attendees with a 15% discount 
off the standard ticket price. 

“Berlin Buzzwords is by far one of the best conferences around if you care about 
search, distributed systems, and NoSQL...” says Shay Banon, founder of 
ElasticSearch. 

Berlin Buzzwords will take place June 4th and 5th 2012 at Urania Berlin 
(http://www.uraniaberlin.de). The 3rd edition of the conference for developers 
and users of open source projects, again focuses on everythin&lt;/pre&gt;</description>
    <dc:creator>Isabel Drost</dc:creator>
    <dc:date>2012-04-26T11:22:58</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4079">
    <title>Repackaging an unpackaged pear file</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4079</link>
    <description>&lt;pre&gt;I received a copy of an application that works with UIMA a few weeks ago from
some colleagues at another location. When I followed the instructions to
install it, I got an error message while unpacking a pear file, and it looks
like an XML file within it contains some hard-coded pathnames to a machine at
the organization that sent our colleagues the application originally. I could
ask them to get in touch with the organization and ask them to recreate the
pear file with relative pathnames so it can be installed on machines on other
networks, and I probably will do that. But I was wondering how hard it would be
just to correct the pathnames, re-package the pear file, and reinstall that
one. I have never worked with UIMA before, so I am learning the basics as I go.
How complicated would it be to create an Eclipse project using the directory
structure that the pear file expanded to, or to run a command line application
that creates a pear file from that directory structure?
Thanks,
Mike


&lt;/pre&gt;</description>
    <dc:creator>Mike O'Leary</dc:creator>
    <dc:date>2012-04-25T17:15:27</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4069">
    <title>CAS Editor instructions</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4069</link>
    <description>&lt;pre&gt;I've been using UIMA for years and years, but I am completely stumped
trying to run the CAS Editor.  Is anyone expert enough to provide an
actual step-by-step guide for getting the CAS Editor to appear, given
an existing type system and annotated XMI?

I have tried everything in the instructions, but keep getting error
messages.  First, missing type system.  After RENAMING and putting
in the root of project, complaint that document is not in a corpus
folder.  Explanation how to create a corpus folder is vague and
confusing.

I guess I'm hoping for STEP BY STEP instructions of exactly what to
click and type in Eclipse, and what should happen at each stage, as
far as expected output.  Discouragingly, nothing I try looks like the pictures
in the UIMA Tools Guide and Reference.

What I'm really hoping for are instructions so clear that 'even faculty' can
follow them to add annotations to documents, provided the documents and
type system are already set up. :)

&lt;/pre&gt;</description>
    <dc:creator>Eric Riebling</dc:creator>
    <dc:date>2012-04-20T20:08:48</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4065">
    <title>SemanticSearchCasIndexer.xml missing</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4065</link>
    <description>&lt;pre&gt;Hi,
I was trying to follow section
*3.5.1.2. Building and Running a CPE including the Semantic Search CAS
Indexer*
**
but I was unable to locate the CAS Consumers SemanticSearchCasIndexer.xml
at
%UIMA_HOME%/examples/descriptors/cas_consumer/SemanticSearchCasIndexer.xml

I need the SemanticSearchCasIndexer.xml. Can you guide me  how to get it or
if it can be downloaded and from where?

Thanks and Regards,
Mohammad Mazharuddin
&lt;/pre&gt;</description>
    <dc:creator>mazhar</dc:creator>
    <dc:date>2012-04-19T11:33:45</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4063">
    <title>CfP 7th Workshop on Virtualization in High-Performance Cloud Computing (VHPC'12)</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4063</link>
    <description>&lt;pre&gt;we apologize if you receive multiple copies of this CFP.

===================================================================

CALL FOR PAPERS

7th Workshop on

Virtualization in High-Performance Cloud Computing

VHPC '12

as part of Euro-Par 2012, Rhodes Island, Greece

===================================================================

Date: August 28, 2012

Workshop URL: http://vhpc.org

SUBMISSION DEADLINE:

Rolling abstract submission
June 4, 2012 - Full paper submission


SCOPE:

Virtualization has become a common abstraction layer in modern
data centers, enabling resource owners to manage complex
infrastructure independently of their applications. Conjointly,
virtualization is becoming a driving technology for a manifold of
industry grade IT services. The cloud concept includes the notion
of a separation between resource owners and users, adding  services
such as hosted application frameworks and queueing. Utilizing the
same infrastructure, clouds carry significant potential for use in
high-performan&lt;/pre&gt;</description>
    <dc:creator>VHPC 12</dc:creator>
    <dc:date>2012-04-19T11:46:20</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4061">
    <title>logging with UIMA</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4061</link>
    <description>&lt;pre&gt;Hi,

I am new to UIMA.

I am using the UIMA Simple Server configured in a tomcat web application.
http://uima.apache.org/d/uima-addons-current/SimpleServer/simpleServerUserGuide.html

When I go to the web page using this url:

http://localhost:8080/uimaweb/uima?mode=form

I can enter some text.   I replaced the  WhitespaceTokenizer.pear with
another tokenizer (the roomNumberAnnotator.pear from another
tutorial).

I press submit and response &amp;lt;result&amp;gt; &amp;lt;/result&amp;gt; just contains the raw
original text.

I have tried to put either a logging.properties or log4j.properties
into my web app's WEB-INF/classes and set the level to SEVERE or
DEBUGGING.   but I dont see the logs entries made by UIMA.

This java logging is confusing too.

How do you generate logs in a tomcat web app with UIMA?



### this is the log4j.properties   ### below this is the logging.properties.
# Root logger option
log4j.rootLogger=DEBUG, stdout

# Direct log messages to stdout
log4j.appender.stdout=org.apache.log4j.ConsoleAppender
log4j.appender.&lt;/pre&gt;</description>
    <dc:creator>Sean K</dc:creator>
    <dc:date>2012-04-19T00:53:27</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4058">
    <title>UIMA Annotator Library</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4058</link>
    <description>&lt;pre&gt;Hello:

I have studied the User's Guide on creating Annotators, testing them and
deploying them with a CPE.

I wonder if there are UIMA annotator Libraries out on the Web for common
annotation such as names, locations, phone numbers etc.

Thanks

Jeremy
&lt;/pre&gt;</description>
    <dc:creator>Jeremy Villalobos</dc:creator>
    <dc:date>2012-04-14T08:34:37</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.apache.uima.general/4057">
    <title>CAS Editor: Sentence Detector shows strange suggestions...</title>
    <link>http://comments.gmane.org/gmane.comp.apache.uima.general/4057</link>
    <description>&lt;pre&gt;Guess this one is for you Jörn ;-)

Can I use the trunk version of the sentence detector tool?

I have downloaded and built the entire uimaj trunk. Configured Eclipse and
have no problems annotating and building models. However the "new" sentence
suggesting view does not seem to work correctly. First I annotate
headlines, subheadlines and paragraphs. The sentence detector is configured
to use my paragraph type and exclude headline and subheadline.

While annotating the suggestions gets worse and worse... The sentences are
repeated... and does not seem to be able to look further if it encounters a
subheadline, because there are not sentence suggestions after the first
subheadline even though there are more paragraphs.

thx,
Peter Thygesen

(also using opennlp-tools 1.5.3-SNAPSHOT, opennlp-uima 1.5.3-SNAPSHOT,
opennlp-maxent.3.0.3-SNAPSHOT is this is important)
&lt;/pre&gt;</description>
    <dc:creator>Peter Thygesen</dc:creator>
    <dc:date>2012-04-13T20:06:49</dc:date>
  </item>
  <textinput rdf:about="http://search.gmane.org/?group=$group=gmane.comp.apache.uima.general">
    <title>Search Engine</title>
    <description>Search the mailing list at Gmane</description>
    <name>query</name>
    <link>http://search.gmane.org/?group=$group=gmane.comp.apache.uima.general</link>
  </textinput>
</rdf:RDF>

