On Wed 4 May 2022, at 20:01, Gareth Evans <[log in to unmask]> wrote:
> Hi Quentin,
>
> What do you mean by "Windows search aspect"? If not too similar,
> docfetcher may be worth a look.
>
> I'm aware of various graphical and command line tools, none of which
> seem to do everything one might hope, but all have their uses.
>
> See for example:
>
> https://seekfast.org/blog/search-text-in-documents/how-to-search-text-in-multiple-files-in-linux/
>
> Contrary to the article's assertions, however, docfetcher allows
> case-insensitive searching,
> ... and allows search phrases if enclosed in
> speech marks (eg. "Bob Smith" not Bob Jones and John Smith)
In testing, I find PDF files appear in search results if they contain quoted phrases, but (Linux) PDF viewers' "Find..." doesn't find the relevant text (which exists) if the text concerned breaks over a line (eg in a column - not a hard "line break" as such.) That seems odd.
There is some query syntax help in docfetcher's in-program help, which says the underlying search engine is Apache Lucene, so it should work along these lines:
https://lucene.apache.org/core/3_4_0/queryparsersyntax.html
Best wishes,
G
>
> I use the version from snap on Debian 11. The only problem I can find
> is that .7z files are seen as empty, so are not indexed.
>
> Helpful error reports at bottom of window, eg. permissions, empty files etc.
>
> More info:
> http://docfetcher.sourceforge.net/en/index.html
>
> Pro version with more features and "fewer bugs"
> https://docfetcherpro.com/
>
> Not sure if it works over network/mapped drives - may need to be run on
> the server concerned.
>
> Hope that helps.
> Gareth
>
>
>
> On Wed 4 May 2022, at 10:01, Quentin Tucker
> <[log in to unmask]> wrote:
>> Hi,
>>
>> We are looking for tools/software to search Linux based file systems to
>> try and by-pass the windows search aspect in drives with large volumes
>> of mixed data.
>>
>> Could anyone recommend something please?
>>
>> Thanks
>>
>> Q Tucker
>>
>> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>> An archive of messages is stored permanently at
>> http://www.jiscmail.ac.uk/lists/data-protection.html
>>
>> If you wish to leave this list please send an email to
>> [log in to unmask] with no subject
>> and leave data-protection as the message body. This is an automated
>> service.
>>
>> Additional subscriber help is available at
>> https://www.jiscmail.ac.uk/help/subscribers.html
>>
>> Any other list queries can be emailed to [log in to unmask]
>>
>> For general JISCMail queries please email [log in to unmask]
>>
>> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>
> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> An archive of messages is stored permanently at
> http://www.jiscmail.ac.uk/lists/data-protection.html
>
> If you wish to leave this list please send an email to
> [log in to unmask] with no subject
> and leave data-protection as the message body. This is an automated
> service.
>
> Additional subscriber help is available at
> https://www.jiscmail.ac.uk/help/subscribers.html
>
> Any other list queries can be emailed to [log in to unmask]
>
> For general JISCMail queries please email [log in to unmask]
>
> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
An archive of messages is stored permanently at http://www.jiscmail.ac.uk/lists/data-protection.html
If you wish to leave this list please send an email to [log in to unmask] with no subject
and leave data-protection as the message body. This is an automated service.
Additional subscriber help is available at https://www.jiscmail.ac.uk/help/subscribers.html
Any other list queries can be emailed to [log in to unmask]
For general JISCMail queries please email [log in to unmask]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|