kaefa

The goal of kaefa is to improve researchers’ ability to identify unexplained factor structures in complex, cross-classified multilevel data in R. It uses an automated exploratory factor analysis (aefa) framework.

Algorithm

The automated exploratory factor analysis (aefa) framework implements a greedy search algorithm to efficiently explore the model space and find improved model configurations. The algorithm iteratively:

Evaluates multiple model candidates with different factor structures and item response models
Selects the best model based on information criteria (DIC, AIC, BIC, etc.)
Assesses item fit and removes poorly fitting items one at a time
Re-estimates the model until convergence to a locally optimal solution

This greedy approach enables efficient exploration of the model space while seeking improved solutions through iterative refinement. The method aligns with model selection and exploratory factor analysis research (Preacher, Zhang, Kim, & Mels, 2013; Jennrich & Bentler, 2011).

References:

Preacher, K. J., Zhang, G., Kim, C., & Mels, G. (2013). Choosing the optimal number of factors in exploratory factor analysis: A model selection perspective. Multivariate Behavioral Research, 48(1), 28-56. https://doi.org/10.1080/00273171.2012.710386
Jennrich, R. I., & Bentler, P. M. (2011). Exploratory bi-factor analysis. Psychometrika, 76(4), 537-549. https://doi.org/10.1007/s11336-011-9218-4

Installation

You can install kaefa from github with:

# install.packages("devtools")
devtools::install_github("seonghobae/kaefa")

Example

This is a basic example which shows you how to solve a common problem:

## basic example code
library('kaefa')
mod1 <- kaefa::aefa(mirt::Science)
mod1

Remote Execution (Optional)

You can preconfigure remote hosts and SSH keys for aefaInit():

options(kaefaServers = c("node1", "node2"))
ssh_keys <- c(
  normalizePath("~/.ssh/kaefa_node1"),
  normalizePath("~/.ssh/kaefa_node2")
)
init <- aefaInit(sshKeyPath = ssh_keys)

Security checklist:

Use absolute paths (expand ~ with normalizePath()).
Restrict key permissions (for example, chmod 600 ~/.ssh/kaefa_node1).
Store keys in encrypted storage or a secrets manager; never commit them.
Rotate keys regularly (for example, quarterly) and limit access to required users or groups.

Local vs Remote Execution Sizing Guide

Use these default thresholds when deciding whether to run aefa() on a local workstation or a remote server.

Workload profile	Suggested runtime	Recommended environment
Up to ~5,000 respondents and up to ~60 items	Usually under 30 minutes	Local machine (8+ CPU threads, 16GB+ RAM)
~5,000-20,000 respondents or ~60-150 items	About 30-120 minutes	Remote VM/cluster node (16+ CPU threads, 32GB+ RAM)
Over ~20,000 respondents or over ~150 items	Often over 2 hours	Remote cluster/HPC (32+ CPU threads, 64GB+ RAM)

These are empirical guidelines. Runtime and memory can vary by hardware, aefa() options (for example, rotation/estimation choices), and parallel job count.

Operational notes:

Prefer local runs for exploratory tuning and small pilot datasets.
Prefer remote runs when model-search cycles are long, memory usage spikes, or multiple analyses must run in parallel.
If you see repeated slow convergence, monitor RAM/CPU and move the workload to remote infrastructure before increasing model complexity.

Interactive Shiny Interface

For applied psychologists who prefer a point-and-click interface without writing code, kaefa now includes an interactive Shiny web application:

# Launch the interactive interface
library('kaefa')
launchAEFA()

The Shiny interface provides:

Easy data upload: Upload your item response data in CSV or RDS format
Simple configuration: Configure factor extraction, rotation methods, and model selection criteria through dropdown menus
Visual results: View factor loadings, item fit statistics, and model fit indices in an organized interface
Export results: Download complete results and summary reports

This makes kaefa accessible to researchers without programming experience while maintaining all the powerful automated factor analysis capabilities.

New Feature: fitdistrplus Integration for Theta Priors

kaefa now supports setting theta priors based on empirical raw score distributions using the fitdistrplus package. This feature allows you to:

Fit distributions to raw scores to inform theta priors
Test if calibration works for non-nominal models
Validate model calibration against empirical distributions

Example usage:

# Fit a distribution to raw scores
fit <- fitThetaPrior(mirt::Science, dist = "norm")

# Test calibration with distribution fit
testResult <- testThetaPriorCalibration(mirt::Science, dist = "norm")

# Apply theta prior during calibration
model <- applyThetaPrior(mirt::Science, fit, minExtraction = 1, maxExtraction = 1)

For more examples and detailed documentation, see the examples/ directory.

Software Quality Information

Continuous Integration (Ubuntu, macOS, Windows)

Ubuntu and Mac environment

windows environment

Contributor Code of Conduct

Name		Name	Last commit message	Last commit date
Latest commit History 711 Commits
.github		.github
R		R
examples		examples
inst		inst
man		man
revdep		revdep
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
CONDUCT.md		CONDUCT.md
DESCRIPTION		DESCRIPTION
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
LICENSE		LICENSE
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
PRD.md		PRD.md
README.Rmd		README.Rmd
README.md		README.md
SECURITY.md		SECURITY.md
TESTING.md		TESTING.md
TESTING_DOCUMENTATION.md		TESTING_DOCUMENTATION.md
TESTING_SUMMARY.md		TESTING_SUMMARY.md
TEST_COMPLETION_SUMMARY.txt		TEST_COMPLETION_SUMMARY.txt
TEST_GENERATION_README.md		TEST_GENERATION_README.md
TRD.md		TRD.md
appveyor.yml		appveyor.yml
codecov.yml		codecov.yml
cran-comments.md		cran-comments.md
kaefa.Rproj		kaefa.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kaefa

Algorithm

Installation

Example

Remote Execution (Optional)

Local vs Remote Execution Sizing Guide

Interactive Shiny Interface

New Feature: fitdistrplus Integration for Theta Priors

Software Quality Information

Continuous Integration (Ubuntu, macOS, Windows)

Ubuntu and Mac environment

windows environment

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

kaefa

Algorithm

Installation

Example

Remote Execution (Optional)

Local vs Remote Execution Sizing Guide

Interactive Shiny Interface

New Feature: fitdistrplus Integration for Theta Priors

Software Quality Information

Continuous Integration (Ubuntu, macOS, Windows)

Ubuntu and Mac environment

windows environment

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages