Beskjeder

Publisert 15. mai 2017 08:06

Ingen forelesning idag.

Publisert 9. mai 2017 16:05

Gamle eksamensoppgaver fra 2011-2016 finner dere her

Det er ikke tillatt med noen hjelpemidler på årets eksamen. Ingen bok, ingen notater, ingen kalkulator. Samme opplegg som alle foregående år, med andre ord.

Dere vil kunne besvare eksamen enten på engelsk eller norsk. Bruk det språket dere er mest komfortable med.

Publisert 9. mai 2017 15:16

Pensumliste:

  • Læreboka, minus kapitlene 11-12 og 16-18.
  • Alle slides og supplerende artikler brukt i undervisningen. Se også forelesningsplanen.
  • Alle øvingsoppgaver samt materiell brukt i gjennomføringen av øvingsopplegget.

Gamle eksamensoppgaver (med løsningsskisser) gjøres tilgjengelige snart.

Publisert 4. mai 2017 09:43

Gruppetimen i dag (04.05) er dessverre avlyst grunnet sykdom. Eventuelle spørsmål kan sendes på e-post eller Piazza. 

Publisert 20. apr. 2017 12:26

We have been made aware of an error in the test "testWeScienceFuzzy()":

Original:
assertTrue(firstResult.getDocument().getOriginalData().startsWith("[10340610]") || firstResult.getDocument().getOriginalData().startsWith("10840450"));

Correct:
assertTrue(firstResult.getDocument().getOriginalData().startsWith("[10340610]") || firstResult.getDocument().getOriginalData().startsWith("[10840450]"));

The mistake was the missing "[]" in the original test-code.

Publisert 3. apr. 2017 22:29

The slides from Rolf Michelsen's guest lecture today can be found here.

Publisert 26. mars 2017 20:12

On 3/4, you don't have to listen to my monotonous droning voice. Instead, you're in for a treat!

Rolf Michelsen will hold a guest lecture titled Information Retrieval versus The Real World. Rolf is a seasoned industry veteran and has worked with large-scale search and search-related technologies for the last two decades in companies such as Fast Search & Transfer, Microsoft and Cxense.

Be there! :-)

Publisert 26. mars 2017 20:07

Due to a scheduling conflict, I have to cancel the lecture tomorrow 27/3. I've updated the lecture plan accordingly.

Publisert 23. mars 2017 16:13

Last Monday I briefly mentioned that I'd post a link to a good tutorial on how neural networks are used in NLP. Here it is!

Publisert 16. mars 2017 13:11

As has been pointed out, the precode for assigment-d has no way of retreiving number of documents for calculating the idf-value. For a quick fix, simply hard-code the number of documents to be 10000, this should make the tests pass.

The tests assume that the tf-idf weighting uses an ntn variant (see slide 41 in lecture 6), so implementations with other variants will most likely not pass the tests, even though they are correct. 

Publisert 13. mars 2017 16:37

Just so that it's clear: As previously mentioned, some of the slides used in this course were created by the authors of the textbook for a course held at Stanford. Hence, when one of the slides used today mention "a midterm on the 22nd", that is a reference to the Stanford course -- there is no midterm on the 22nd in INF3800/INF4800! :-)

Publisert 5. mars 2017 19:35

I ran out of time last Monday and didn't cover Chapter 8 and metrics for evaluating relevancy. Will play catch-up tomorrow!

Publisert 27. feb. 2017 10:27

Because of low interest in group sessions on Wednesdays, these will no longer be. The group sessions on Thursdays will continue as normal, but now with two teaching assistants. Keep using Piazza for asking and answering each others questions!

Publisert 18. feb. 2017 13:39

Det blir ingen forelesning mandag 20/2 grunnet reisevirksomhet.

Publisert 1. feb. 2017 15:19

We have now created a Piazza for INF3800/4800. This page can be used to ask and answer the questions of fellow students. To sign up, follow this link. Please read through the "Welcome to Piazza!" before posting any questions.

Publisert 1. feb. 2017 12:46

The textbook covers the Porter stemming algorithm in some detail. If you want to study this (and other stemmers) at the source code level, see, e.g., here or here.

Publisert 1. feb. 2017 12:42

The textbook mentions Soundex when it comes to doing phonetic searching and retrieval based on a "soundslike" operator. In Monday's lecture I mentioned the improved Double Metaphone algorithm. You can also take a look at some source code to get a better feel for how the algorithm works.

Publisert 31. jan. 2017 16:30

The "Hardware basics" section in Chapter 4 is somewhat obsolete, e.g., it's ten years old and doesn't mention SSD disks. Here are some more up-to-date numbers, for reference.

Publisert 23. jan. 2017 10:41

Obligkalender med oversikt over publiserings- og innleveringdatoer for obliger er nå ute, og ligger under Ressurser på høyresiden på siden. Første obligatoriske oppgave vil bli publisert i løpet av dagen!