#duraspace IRC Log


IRC Log for 2017-10-24

Timestamps are in GMT/BST.

[6:30] -wolfe.freenode.net- *** Looking up your hostname...
[6:30] -wolfe.freenode.net- *** Checking Ident
[6:30] -wolfe.freenode.net- *** Found your hostname
[6:30] -wolfe.freenode.net- *** No Ident response
[6:30] * DuraLogBot (~PircBot@webster.duraspace.org) has joined #duraspace
[6:30] * Topic is 'Welcome to DuraSpace IRC. This channel is used for formal meetings and is logged - http://irclogs.duraspace.org/'
[6:30] * Set by tdonohue on Thu Sep 15 17:49:38 UTC 2016
[12:14] * mhwood (~mhwood@mhw.ulib.iupui.edu) has joined #duraspace
[13:03] * tdonohue (~tdonohue@dspace/tdonohue) has joined #duraspace
[13:30] <DSpaceSlackBot> <sven> I like the fact the OAI-Webapp does not query the database for performance reasons.
[13:30] <DSpaceSlackBot> <mwood> Someone has benchmarked this?
[13:42] <DSpaceSlackBot> <tom_desair> SOLR is good for fuzzy matching or when doing searches on fields that are hard to index in the database (e.g. metadatavalue `text_value` for example). But since OAI only does queries on identifiers or set IDs, I think the database can even be faster than SOLR.
[13:42] <DSpaceSlackBot> <sven> Actually i dont know, im satisfied with the OAI performance ..
[13:44] <DSpaceSlackBot> <sven> Also OAI responses are cached
[13:45] <DSpaceSlackBot> <tom_desair> Database responses are also cached (in DSpace 6) ;)
[13:45] <DSpaceSlackBot> <sven> ofc
[13:45] <DSpaceSlackBot> <sven> :slightly_smiling_face:
[13:47] <DSpaceSlackBot> <bollini> I have seen a complete different figure where solr overcome postgres also for easy query on an indexed column. Unfortunately I don't have any record about the test that we make but it was something like 1M rows (very simple structure) with the DOI as unique id were queried by doi 10 times faster on solr than postgres with a default installation
[13:48] <DSpaceSlackBot> <sven> I just wanted to note , that performance is a big point especially in OAI webapp
[13:48] <DSpaceSlackBot> <bollini> this post also seems to validate my finding https://stackoverflow.com/questions/10053050/why-is-solr-so-much-faster-than-postgres
[13:48] <kompewter> [ performance - Why is Solr so much faster than Postgres? - Stack Overflow ] - https://stackoverflow.com/questions/10053050/why-is-solr-so-much-faster-than-postgres
[13:54] <DSpaceSlackBot> <sven> For example our search index is ~600MB the oai index only 80MB
[13:54] <DSpaceSlackBot> <mwood> Notice the bit about the 4400x improvement in Pg query performance due to paying attention to how it was being used.
[13:54] <DSpaceSlackBot> <sven> :slightly_smiling_face:
[14:03] <DSpaceSlackBot> <sulfrian> @sven The search index contains the full text index of the metadata but also of all the files, but the oai index only contain the metadata. So it is reasonable, that the search index is larger.
[15:13] * dyelar (~dyelar@dyelar.mrb.ku.edu) has joined #duraspace
[20:02] * mhwood (~mhwood@mhw.ulib.iupui.edu) Quit (Remote host closed the connection)
[21:41] * tdonohue (~tdonohue@dspace/tdonohue) has left #duraspace
[22:27] * dyelar (~dyelar@dyelar.mrb.ku.edu) Quit (Quit: Leaving.)

These logs were automatically created by DuraLogBot on irc.freenode.net using the Java IRC LogBot.