Package 'ura' reference manual

Title:	Monitoring Rater Reliability
Description:	Provides researchers with a simple set of diagnostic tools for monitoring the progress and reliability of raters conducting content coding tasks. Goehring (2024) <https://bengoehring.github.io/improving-content-analysis-tools-for-working-with-undergraduate-research-assistants.pdf> argues that supervisors---especially supervisors of small teams---should utilize computational tools to monitor reliability in real time. As such, this package provides easy-to-use functions for calculating inter-rater reliability statistics and measuring the reliability of one coder compared to the rest of the team.
Authors:	Benjamin Goehring [aut, cre]
Maintainer:	Benjamin Goehring <[email protected]>
License:	MIT + file LICENSE
Version:	1.0.0
Built:	2025-02-06 04:56:30 UTC
Source:	https://github.com/bengoehring/ura

Anxiety ratings

Description

Simulated data from three raters rating the anxiety of 20 individuals. The codings range from 1 (no anxiety) to 6 (extremely anxious). The data are forked directly from the irr package, with the only difference being the shape of the dataset.

Usage

anxiety
anxiety

Format

## 'anxiety' A data frame with 60 rows and 3 columns:

subject_id: The subject being screened for anxiety (numeric).
rater_id: The rater evaluating the subject for anxiety (numeric).
anxiety_level: The level of anxiety observed in the subject by the rater (numeric).

Source

<https://cran.r-project.org/package=irr>

Psychiatric diagnoses of patients

Description

Data from Fleiss (1971) concerning the psychiatric conditions of thirty patients as evaluated by six raters. The data are forked directly from the irr package, with the only difference being the shape of the dataset.

Usage

diagnoses
diagnoses

Format

## 'diagnoses' A data frame with 180 rows and 3 columns:

patient_id: The patient being screened for a psychiatric condition (numeric).
rater_id: The rater evaluating the patient for a psychiatric condition (numeric).
diagnosis: The psychiatric diagnosis of the patient (factor).

Source

Fleiss, J.L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76, 378-382.

References

Fleiss, J.L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76, 378-382.

int_return_dbl_coded

Description

int_return_dbl_coded An internal function to return the subjects double-coded by the raters. It runs a number of checks along the way

Usage

int_return_dbl_coded(
  in_object_name,
  in_rater_column,
  in_subject_column,
  in_coding_column
)
int_return_dbl_coded(
  in_object_name,
  in_rater_column,
  in_subject_column,
  in_coding_column
)

Arguments

`in_object_name`	A dataframe or tibble containing raters' codings. Each row should contain the assigned coding from a given rater-subject.
`in_rater_column`	The name of the column containing the raters' names as a string.
`in_subject_column`	The name of the column containing the names of the subjects being coded as a string.
`in_coding_column`	The name of the column containing the codings assigned by the raters as a string.

Author(s)

Benjamin Goehring <[email protected]>

irr_stats

Description

irr_stats calculates a variety of IRR statistics.

Usage

irr_stats(
  object_name,
  rater_column,
  subject_column,
  coding_column,
  round_digits = 2,
  stats_to_include = c("Percentage agreement", "Krippendorf's Alpha")
)
irr_stats(
  object_name,
  rater_column,
  subject_column,
  coding_column,
  round_digits = 2,
  stats_to_include = c("Percentage agreement", "Krippendorf's Alpha")
)

Arguments

`object_name`	A dataframe or tibble containing raters' codings. Each row should contain the assigned coding from a given rater-subject.
`rater_column`	The name of the column containing the raters' names as a string.
`subject_column`	The name of the column containing the names of the subjects being coded as a string.
`coding_column`	The name of the column containing the codings assigned by the raters as a string.
`round_digits`	The number of decimals to round the IRR values by. The default is 2.
`stats_to_include`	The IRR statistics to include in the output. Currently only supports percent agreement and Krippendorf's Alpha. See the documentation of the irr package for more information about specific IRR statistics.

Value

A tibble containing the IRR statistic, the statistic's value, and the number of subjects used to calculate the statistic.

Author(s)

Benjamin Goehring <[email protected]>

Examples

# Return IRR statistics for the diagnoses dataset:
irr_stats(diagnoses,
          rater_column = 'rater_id',
          subject_column = 'patient_id',
          coding_column = 'diagnosis')

# And IRR statistics for the anxiety dataset:
irr_stats(anxiety,
          rater_column = 'rater_id',
          subject_column = 'subject_id',
          coding_column = 'anxiety_level')

# Return IRR statistics for the diagnoses dataset:
irr_stats(diagnoses,
          rater_column = 'rater_id',
          subject_column = 'patient_id',
          coding_column = 'diagnosis')

# And IRR statistics for the anxiety dataset:
irr_stats(anxiety,
          rater_column = 'rater_id',
          subject_column = 'subject_id',
          coding_column = 'anxiety_level')

rater_agreement

Description

rater_agreement calculates the percent agreement between each rater and the other raters who coded the same subjects.

Usage

rater_agreement(object_name, rater_column, subject_column, coding_column)
rater_agreement(object_name, rater_column, subject_column, coding_column)

Arguments

`object_name`	A dataframe or tibble containing raters' codings. Each row should contain the assigned coding from a given rater-subject.
`rater_column`	The name of the column containing the raters' names as a string.
`subject_column`	The name of the column containing the names of the subjects being coded as a string.
`coding_column`	The name of the column containing the codings assigned by the raters as a string.

Value

A tibble where each row notes the percent agreement between rater i and all other raters who coded the same subjects (percent_agree). The n_multi_coded column notes how many subjects have been coded by rater i that have also been coded by other raters (i.e., the denominator for the percent_agree value).

Author(s)

Benjamin Goehring <[email protected]>

Examples

# Example data: 3 raters assigning binary values to 10 subjects
example_data <- tibble::tribble(
  ~rater,	~subject,	~coding,
  1,	1,	1,
  1,	2,	0,
  1,	3,	1,
  1,	4,	0,
  2,	3,	1,
  2,	9,	0,
  2,	10,	1,
  2,	4,	1,
  2,	5,	1,
  2,	6,	1,
  3,	5,	1,
  3,	6,	1,
  3,	7,	1,
  3,	8,	1,
)

# Find percent agreement by rater
rater_agreement(example_data,
                rater_column = 'rater',
                subject_column = 'subject',
                coding_column = 'coding')


# Example data: 3 raters assigning binary values to 10 subjects
example_data <- tibble::tribble(
  ~rater,	~subject,	~coding,
  1,	1,	1,
  1,	2,	0,
  1,	3,	1,
  1,	4,	0,
  2,	3,	1,
  2,	9,	0,
  2,	10,	1,
  2,	4,	1,
  2,	5,	1,
  2,	6,	1,
  3,	5,	1,
  3,	6,	1,
  3,	7,	1,
  3,	8,	1,
)

# Find percent agreement by rater
rater_agreement(example_data,
                rater_column = 'rater',
                subject_column = 'subject',
                coding_column = 'coding')

Package 'ura'

Help Index

Anxiety ratings

Description

Usage

Format

Source

Psychiatric diagnoses of patients

Description

Usage

Format

Source

References

int_return_dbl_coded

Description

Usage

Arguments

Author(s)

irr_stats

Description

Usage

Arguments

Value

Author(s)

Examples

rater_agreement

Description

Usage

Arguments

Value

Author(s)

Examples