<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:admin="http://webns.net/mvcb/">
  <channel rdf:about="http://blog.gmane.org/gmane.comp.lang.ruby.ferret.general">
    <title>gmane.comp.lang.ruby.ferret.general</title>
    <link>http://blog.gmane.org/gmane.comp.lang.ruby.ferret.general</link>
    <description/>
    <syn:updatePeriod>hourly</syn:updatePeriod>
    <syn:updateFrequency>1</syn:updateFrequency>
    <syn:updateBase>1901-01-01T00:00+00:00</syn:updateBase>
    <items>
      <rdf:Seq>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4528"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4526"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4525"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4524"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4522"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4521"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4519"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4518"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4517"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4516"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4515"/>
      </rdf:Seq>
    </items>
    <image rdf:resource="http://gmane.org/img/gmane-25t.png"/>
    <textinput rdf:resource=""/>
  </channel>
  <image rdf:about="http://gmane.org/img/gmane-25t.png">
    <title>Gmane</title>
    <url>http://gmane.org/img/gmane-25t.png</url>
    <link>http://gmane.org</link>
  </image>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4528">
    <title>(no subject)</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4528</link>
    <description>&lt;pre&gt;http://medcorpadaptions.com.au.tmp.anchor.net.au/family.php?gid=41&amp;amp;kj=57&amp;amp;fece=828&amp;amp;cj=18
&lt;/pre&gt;</description>
    <dc:creator>Amit</dc:creator>
    <dc:date>2011-12-12T08:09:45</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4526">
    <title>Searching from a Lucene created Index</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4526</link>
    <description>&lt;pre&gt;Hi,

I'm trying to open a Index created by Lucene 1.9.1. I know the index has not been
corrupted, I can browse it with Luke but when I try:
irb&amp;gt; Ferret::Index::IndexReader.new("./index'")

I get the following error message:
Ferret::FileNotFoundError: File Not Found Error occured at &amp;lt;except.c&amp;gt;:93 in xraise
Error occured in index.c:840 - sis_find_segments_file
couldn't find segments file


from (irb):7:in `initialize'
from (irb):7:in `new'
from (irb):7

the segments file is inside ./index but something get wrong.

Help is appreciated :)

Ferret::VERSION = 0.11.6

Thanks!
&lt;/pre&gt;</description>
    <dc:creator>Jerrie Union</dc:creator>
    <dc:date>2011-04-05T21:34:16</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4525">
    <title>Custom highlighter/match vector access?</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4525</link>
    <description>&lt;pre&gt;Hi everyone,

I know from the archives things have kinda slowed down on ferret and there's an effort ongoing with lucy, but I was wondering if anyone had discovered a way to enumerate the matches of a particular field in the document and get the offsets?

With what I'm trying to do, ferret will be indexing large portions of structured information, but I really don't want to store it all in the ferret index just to have highlighting.  My understanding (I'm still new at this) is that if you index and store the match offsets, you can do this without storing the full text of the field.

Ideally, what I'd like is to expose  the contents of the C MatchRange structure as an array of Ruby hash objects so that I could then use those offsets in the actual data store to create my own highlighted extracts (or something along those lines).

Short of adding a hacked version of searcher_highlight to the C API to do this and creating a corresponding wrapped Ruby version, is there any way to get to this information right now&lt;/pre&gt;</description>
    <dc:creator>Andrew S. Townley</dc:creator>
    <dc:date>2011-02-23T13:57:42</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4524">
    <title>Apache Lucy invitation</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4524</link>
    <description>&lt;pre&gt;[[cross-posted from the ruby-forum list]]

Hi.

Back in 2006[0], Dave Balmain and Marvin Humphrey agreed to join forces
on their search projects, Ferret and KinoSearch (respectively), and
created the Lucy project at Apache.

Now it's 2010. Lucy is in the Apache Incubator.[1]

I'm a part of the Apache Lucy project, and I'd like to invite you to
become a part of it too.

The main goal of Apache Lucy is to provide the core C code for
language-specific implementations, like Ferret does for Ruby. Now's your
chance to help define what Apache Lucy looks like for Ruby.

Mailing list information at the Incubator site[1].

cheers,
Peter Karman


[0] http://www.perlmonks.org/?node_id=556317
[1] http://incubator.apache.org/lucy/


&lt;/pre&gt;</description>
    <dc:creator>Peter Karman</dc:creator>
    <dc:date>2010-11-21T17:23:20</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4522">
    <title>Ferret search engine as a daemon?</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4522</link>
    <description>&lt;pre&gt;_______________________________________________
Ferret-talk mailing list
Ferret-talk-GrnCvJ7WPxnNLxjTenLetw&amp;lt; at &amp;gt;public.gmane.org
http://rubyforge.org/mailman/listinfo/ferret-talk&lt;/pre&gt;</description>
    <dc:creator>Zakay Danial</dc:creator>
    <dc:date>2010-05-21T12:10:56</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4521">
    <title>Can i get a list of all unique indexed words?</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4521</link>
    <description>&lt;pre&gt;I have a requirement to provide a book-style browseable 'index' of all
our resources (which are already indexed with ferret).  I thought that a
nice simple way to do this would be to pull every unique indexed word
from the ferret index, so that when the user clicks on a word in the
index, i just do a regular ferret search using that word.

With this approach, the only work i need to do is to generate the list
of terms in the first place (and refresh it occasionally).  Is there a
way to pull this out of the ferret index somehow?  It doesn't have to
happen in real time, i could do it in a cron job and save the results to
a text file, or whatever.  So, i don't mind if it's a slow process.

Grateful for any advice - max
&lt;/pre&gt;</description>
    <dc:creator>Max Williams</dc:creator>
    <dc:date>2009-12-08T14:54:30</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4519">
    <title>Problem with case sensitivity</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4519</link>
    <description>&lt;pre&gt;_______________________________________________
Ferret-talk mailing list
Ferret-talk-GrnCvJ7WPxnNLxjTenLetw&amp;lt; at &amp;gt;public.gmane.org
http://rubyforge.org/mailman/listinfo/ferret-talk&lt;/pre&gt;</description>
    <dc:creator>Max Williams</dc:creator>
    <dc:date>2009-11-26T11:13:12</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4518">
    <title>Invitation to connect on LinkedIn</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4518</link>
    <description>&lt;pre&gt;_______________________________________________
Ferret-talk mailing list
Ferret-talk-GrnCvJ7WPxnNLxjTenLetw&amp;lt; at &amp;gt;public.gmane.org
http://rubyforge.org/mailman/listinfo/ferret-talk&lt;/pre&gt;</description>
    <dc:creator>Santoshkumar Patil</dc:creator>
    <dc:date>2009-11-13T19:51:30</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4517">
    <title>How do I index very large file?</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4517</link>
    <description>&lt;pre&gt;Hello all, I'm doing a monitor application,
which fetches log file of application and indexing it,
how do I index very large file?,
like up to serveral GB. because the application logs
may log very large file in short time.
&lt;/pre&gt;</description>
    <dc:creator>femto Zheng</dc:creator>
    <dc:date>2009-08-18T07:30:30</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4516">
    <title>Can't remove duplicate</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4516</link>
    <description>&lt;pre&gt;Hello all, I can't remove duplicate,I'm using ferret
to index log file in order to monitor application activity,
what I want to do is index data based on the uniqueness of
[filename,line](actullay should be [host,filename,line],
the code is following:

if !$indexer

      field_infos = Ferret::Index::FieldInfos.new(:index =&amp;gt;
:untokenized_omit_norms,
                                   :term_vector =&amp;gt; :no)
      field_infos.add_field(:content, :store =&amp;gt; :yes, :index =&amp;gt; :yes)

      $indexer = Ferret::I.new(:path =&amp;gt; index_dir,
                               :field_infos =&amp;gt; field_infos,
                               :key =&amp;gt; [:filename, :line],
                               :max_buffered_docs=&amp;gt;100)

      #$indexer ||= Ferret::I.new(:path=&amp;gt;index_dir, :key =&amp;gt; ['filename',
'line'], :max_buffered_docs=&amp;gt;100) #unique host,file_name,line
      #$indexer.field_infos.add_field(:time,
      #                               #:default_boost =&amp;gt; 20,
      #                               :store =&amp;gt; :yes,
      #               &lt;/pre&gt;</description>
    <dc:creator>femto Zheng</dc:creator>
    <dc:date>2009-08-18T06:27:15</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4515">
    <title>Ferret Usability</title>
    <link>http://comments.gmane.org/gmane.comp.lang.ruby.ferret.general/4515</link>
    <description>&lt;pre&gt;Guys,

I am new to Ferret- I have mixed feeling about this thing. On one side  
I really like the simplicity of the system- its easy to deploy and  
used, and I have a lot of choices on integration from aaf or doing my  
own isn't too hard too. On the other hand, I heard a lot of horrible  
stories from index corruption to segfaults. The most classical thread  
I can find is here:

http://groups.google.com/group/rubyonrails-deployment/browse_thread/thread/980fe7cb20cb97dd

Even Ezra&amp;lt; at &amp;gt;EY is basically saying Ferret is unusable. May I know how's  
the situation now? Anyone can nail down what actually had happened on  
their segfaults/index corruption?
&lt;/pre&gt;</description>
    <dc:creator>Yeung William</dc:creator>
    <dc:date>2009-08-18T03:41:35</dc:date>
  </item>
  <textinput rdf:about="http://search.gmane.org/?group=$group=gmane.comp.lang.ruby.ferret.general">
    <title>Search Engine</title>
    <description>Search the mailing list at Gmane</description>
    <name>query</name>
    <link>http://search.gmane.org/?group=$group=gmane.comp.lang.ruby.ferret.general</link>
  </textinput>
</rdf:RDF>

