RMIT University
Browse

Computer-mediated representations: A qualitative examination of algorithmic vision and visual style

Download (3.25 MB)
journal contribution
posted on 2025-10-06, 02:10 authored by Terrell ThomsonTerrell Thomson
To the general public, text-to-image generators, such as Midjourney and DALL-E, seem to work through magic and, indeed, their inner workings are often frustratingly opaque. This is, in part, due to the lack of transparency from big tech companies around aspects like training data and how the algorithms powering their generators work, on one hand, and the deep and technical knowledge in computer science and machine learning, on the other, that is required to understand these workings. Acknowledging these aspects, this qualitative examination seeks to better understand the black box of algorithmic vision through asking a large language model to first describe two sets of visually distinct journalistic images. The resulting descriptions are then fed into the same large language model to see how the AI tool remediates these images. In doing so, this study evaluates how machines process images in each set and which specific visual style elements across three dimensions (representational, aesthetic, and technical) machine vision regards as important for the description and which it does not. Taken together, this exploration helps scholars understand more about how computers process, describe, and render images, including the attributes that they focus on and tend to ignore when doing so.<p></p>

History

Related Materials

  1. 1.
    ISSN - Is published in 1470-3572 (Visual Communication)
  2. 2.
    EISSN - Is published in 1741-3214 (Visual Communication)
  3. 3.

Journal

Visual Communication

Total pages

22

Publisher

SAGE

Language

en

Copyright

© The Author(s) 2025

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC