Added shallow search for data.table in tables() #7580

manmita · 2026-01-09T23:08:26Z

added arg depth = 1L to tables() one for shallow search
if depth is 0 then its the data.table
if depth is 1, we loop through list-like objects using is.list and which are not data.table or data.frame
if depth > 1, we throw error

added name for the nested list found parent[[1]] or parent$child
pre-allocating info to avoid reallocation cost

…o feat/adding_list_search_to_tables

manmita · 2026-01-09T23:23:27Z

Hello,

I created a new PR in replacement of #7568

Reasons: There was some git issue there and the merge became too complex and I changed the algo because I didnt know previously that rbind or cbind would cost for re-allocation

The current PR considers that part and avoids appends

Previous PR : creating seperate data.table called info and rbind at the end
This PR: pre-allocates for a total-sized data.table and fills the info

manmita · 2026-01-09T23:26:45Z

In reply to previous comment of @jangorecki

An example of when this new feature could be useful?

To support lists which occur due to split.data.table or fread like the following

list(data.table(a = 1, b = 4:6)),
      data.table(a = 2, b = 7:10))

The original code supported data.table() top level and this code adds support for list(data.table) if the arg shallow_search = TRUE

manmita · 2026-01-09T23:36:23Z

Example of the original code and the new feature is as follows

> A = list(data.table(a = 1, b = 4:6),
      data.table(a = 2, b = 7:10))
> B = list(data.table(a = 1, b = 4:6), 1:5)
> C = data.table(a = 1, b = 4:6)
> tables()
   NAME NROW NCOL MB COLS    KEY
1:    C    3    2  0  a,b [NULL]
Total: 0MB using type_size
> tables(shallow_search = TRUE)
     NAME NROW NCOL MB COLS    KEY
1: A[[1]]    3    2  0  a,b [NULL]
2: A[[2]]    4    2  0  a,b [NULL]
3: B[[1]]    3    2  0  a,b [NULL]
4:      C    3    2  0  a,b [NULL]
Total: 0MB using type_size
> D = list(d = data.table(a = 1, b = 4:6), x = 1:5)
> tables(shallow_search = TRUE)
     NAME NROW NCOL MB COLS    KEY
1: A[[1]]    3    2  0  a,b [NULL]
2: A[[2]]    4    2  0  a,b [NULL]
3: B[[1]]    3    2  0  a,b [NULL]
4:      C    3    2  0  a,b [NULL]
5:    D$d    3    2  0  a,b [NULL]
Total: 0MB using type_size

tables() work same as before and tables(shallow_search = TRUE) searches 1 level

codecov · 2026-01-09T23:37:12Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.03%. Comparing base (1bd88cb) to head (c65ff92).

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #7580   +/-   ##
=======================================
  Coverage   99.02%   99.03%           
=======================================
  Files          87       87           
  Lines       16896    16937   +41     
=======================================
+ Hits        16732    16773   +41     
  Misses        164      164

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2026-01-09T23:38:25Z

No obvious timing issues in HEAD=feat/adding_list_search_to_tables

Generated via commit c65ff92

Download link for the artifact containing the test results: ↓ atime-results.zip

Task	Duration
R setup and installing dependencies	3 minutes and 1 seconds
Installing different package versions	22 seconds
Running and plotting the test cases	4 minutes and 2 seconds

MichaelChirico · 2026-02-06T17:42:51Z

inst/tests/tests.Rraw

+xenv2$N = list(a = 1:5)
+setkey(xenv2$M$b, a)
+setindex(xenv2$M$b, b)
+test(2360.1, tables(env = xenv2, shallow_search = TRUE)$NAME, c("DT", "L[[1]]", "L[[2]]", "M$b"))


prefer saving the output to re-running tables(env = xenv2, shallow_search = TRUE) many times.

alternatively, just do one test like

test(2360.1, tables(env = xenv2, shallow_search = TRUE)[, .(NAME, NROW, NCOL)], data.table(...))

making it one test

MichaelChirico · 2026-02-06T17:43:18Z

inst/tests/tests.Rraw

+xenv2$M = list(b = data.table(a = 1, b = 4:6), a = 1:5)
+xenv2$N = list(a = 1:5)
+setkey(xenv2$M$b, a)
+setindex(xenv2$M$b, b)


nit: I would move setindex() closer to the index=TRUE tests

MichaelChirico · 2026-02-06T17:44:14Z

R/tables.R

+    }
+  }
+  else {
+    # the original code path when shallow_search=FALSE


this comment doesn't make sense outside the context of this PR, "the original code path" will be a relic within a few months

MichaelChirico · 2026-02-06T17:46:56Z

R/tables.R

 tables = function(mb=type_size, order.col="NAME", width=80L,
-                  env=parent.frame(), silent=FALSE, index=FALSE)
+                  env=parent.frame(), silent=FALSE, index=FALSE,
+                  shallow_search=FALSE)


I am thinking the best way to go about this is actually something like either depth=0 or recursive=FALSE. Then both cases share the same logic, except that the default cuts out after the shallow search.

If the code to do a recursive walk is proving intimidating, we can do depth=0, and this PR can support depth=1 and error for depth>1 "not yet supported" and leave it for future work.

WDYT?

yeah we can do a depth = 0, 1 and error at depth >1 for this PR

…o feat/adding_list_search_to_tables

MichaelChirico · 2026-02-07T08:37:49Z

R/tables.R

+  }
+  else {
+    # for depth greater than 1L,recursion is not implemented yet
+    stop("depth > 1L is not implemented yet", call. = FALSE)


ah, sorry, the "correct" way to make that lint go away is to use stopf() instead

MichaelChirico · 2026-02-07T08:38:54Z

R/tables.R

+    list_index = which(is_list & !is_dt & !is_df)
+    # obj_list is a list of lists of data.tables found inside lists
+    obj_list = vector("list", length(list_index))
+    #make a listof size list_index and add wl in it


Suggested change

#make a listof size list_index and add wl in it

# make a list of size list_index and add wl in it

feat(2606): added shallow search for data.table in tables()

803f763

manmita requested a review from MichaelChirico as a code owner January 9, 2026 23:08

manmita added 2 commits January 10, 2026 04:47

Merge branch 'master' of https://github.com/Rdatatable/data.table int…

b9fdca8

…o feat/adding_list_search_to_tables

feat(2606): re-numbered tests

107a65b

feat(2606): fixed 2360.2 test

ca8701e

manmita added 5 commits January 10, 2026 05:38

feat(2606): added shallow_search arg into tables.Rd file

174b0ea

feat(2606): added tests for mb = TRUE and empty env

03e58c9

feat(2606): add more tests to fix coverage issue

3ba7c1b

feat(2606): fix tests 2360 for type

1e1fd11

feat(2606): fix tests 2360

1952290

MichaelChirico reviewed Feb 6, 2026

View reviewed changes

manmita added 5 commits February 7, 2026 04:54

Merge branch 'master' of https://github.com/Rdatatable/data.table int…

40a188d

…o feat/adding_list_search_to_tables

feat(2606): update tests and news

0f70dc9

feat(2606): update tests and comment

2f19256

feat(2606): add the dt to test 2366.1

afd197a

feat(2606): updated comments and doc from shallow search to depth arg

b591887

manmita requested a review from MichaelChirico February 6, 2026 23:51

feat(2606): add call. to stop

866820a

MichaelChirico reviewed Feb 7, 2026

View reviewed changes

MichaelChirico and others added 2 commits February 7, 2026 00:40

else placement style

2880f1e

feat(2606): use stopf and comment change

c65ff92

manmita requested a review from MichaelChirico February 7, 2026 17:25

	#make a listof size list_index and add wl in it
	# make a list of size list_index and add wl in it

Added shallow search for data.table in tables() #7580

Are you sure you want to change the base?

Added shallow search for data.table in tables() #7580

Conversation

manmita commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

manmita commented Jan 9, 2026

Uh oh!

manmita commented Jan 9, 2026

Uh oh!

manmita commented Jan 9, 2026

Uh oh!

codecov bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

manmita commented Jan 9, 2026 •

edited

Loading

codecov bot commented Jan 9, 2026 •

edited

Loading

github-actions bot commented Jan 9, 2026 •

edited

Loading