Deakin University
Browse

Language-guided Visual Reasoning via Deep Neural Networks

Download (74.27 MB)
thesis
posted on 2025-10-27, 02:50 authored by Long Hoang Dang
This thesis introduces innovative deep neural network architectures designed to address vision-and-language reasoning tasks and a prompting technique to query massive Large Vision-Language Models. The focus is on evolving from basic one-step classification to developing dynamic, query-specific computational models. These models support iterative reasoning, visual-language understanding, and the connection of abstract linguistic concepts with real visual elements.<p></p>

History

Open access

  • Yes

Language

eng

Copyright notice

All rights reserved

Editor/Contributor(s)

Truyen Tran, Vuong Le, Thao Le

Pagination

164 p.

Degree type

Doctorate

Degree name

Ph.D.

Usage metrics

    Theses

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC