Inclusive Requirements Elicitation

AI in Libraries to Support Respectful Management of Indigenous Knowledges

Samantha Callaghan, Arianna Ciula, Tiffany Ong, Miguel Vieira

Context for Requirements Elicitation

  • iREAL: Indigenous data considerations
  • User research (to be re-integrated)
  • Workshop dashboard constraints
    • Models and data
  • Today's focus:
    • Identify use cases and requirements

Requirements

  • What is a requirement?
  • Purpose: How does it fit project/user needs?
  • Definition: How do we specify them?
  • Prioritisation: How do we rank importance?

MoSCoW Prioritisation

  • Must have
  • Should have
  • Could have
  • Won't have this time
    • Defines out-of-scope items
    • Potential for future phases

MoSCoW Prioritisation

MoSCow prioritisation diagram

System Architecture

Architecture diagram
  1. School records: NSW Aboriginal school records (1876-1979)
  2. Data preparation: Conversion to JSON and Markdown
  3. Data extraction: AI/ML models for metadata extraction
  4. Dashboard: Frontend to explore processed data

Data Preparation

  1. Original documents converted to:

    • JSON (for data extraction)
    • Markdown (for frontend rendering)
  2. Each record divided into smaller, manageable sections

  3. Each section is processed for data extraction

Data Extraction

  1. NER: SpanMarker model
  2. Geocoding: Custom transformer
  3. Keywords: LLM (Mistral 7B) with custom prompt for 5 keywords per text chunk
  4. Topics: LLM with custom prompt for overarching themes

Thank you

samantha.callaghan@kcl.ac.uk