运行多行的R代码,每次都有一个变量以提高可读性
我希望通过查看是否有一种“循环”或“重新运行”的代码行的方法来提高代码的可读性,而这些代码线非常相似,但每次都有一个变量不同。
我的实际数据分析涉及从blme
软件包运行许多blmer
调用。我的每个分析都有一个因变量,一个自变量(其中有很多),“波浪”变量(由于在3个时间点上收集了数据)和独特的参与者ID作为随机效果。
我正在尝试构建许多模型,所有模型都非常相似,但是每个模型都在输入的变量上不同。
在以下代码中,我概述了更多详细信息,构建了一个新的,虚拟的数据文件,并试图重新创建与实际文件中类似的模型。
该代码在我的真实数据和虚拟数据中没有问题。我想提请注意这里的关注是,即使只有3种型号(如下文我的示例中),该代码开始变得漫长而重复。
##test script##
library(dplyr)
library(tidyverse)
library(blme)
#packages loaded - I'm not sure these three are exactly needed, I just loaded
#dplyr and tidyverse incase...but blme is for the Bayesian models coming later
#everything below worked on RStudio on my end but, I like I say, I don't
#know if that is because of the above packages or not...
##build a file
DV0 <- c(100, 50, 75, 80, 20, 30) #let's say performance on a soccer task at time 1 - max 100
DV1 <- c(100, 60, 80, 80, 25, 40) #performance on soccer task at time 2
DV2 <- c(95, 55, 70, 70, 20, 35) #performance on soccer task at time 3
IV1.0 <- c(90, 60, 65, 75, 40, 50) #score on cognitive task A at time 1 - max 100
IV1.1 <- c(95, 70, 75, 80, 50, 70) #score on cog task A at time 2
IV1.2 <- c(90, 55, 60, 70, 45, 60) #score on cog task A at time 3
IV2.0 <- c(10, 40, 50, 60, 20, 25) #score on cognitive task B at time 1 - max 100
IV2.1 <- c(20, 50, 60, 75, 35, 35) #score on cog task B at time 2
IV2.2 <- c(15, 40, 40, 55, 25, 25) #score on cos task B at time 3
id <- c("Jon", "Sara", "Lisa", "Tim", "Joe", "Paul")
##create a data frame before pivot to a better format for longitudinal data
df <- data.frame(DV0, DV1, DV2, IV1.0, IV1.1, IV1.2, IV2.0, IV2.1, IV2.2,
id)
df.long <- long_panel(df, begin = 0, end = 2, label_location = "end")
#now onto the main analyses
#here I want to use "blmer" from "blme" package to understand how performance
#on the soccer task first is affected by time alone (model1 below).
#Next,I want to check whether adding performance on cognitive task A
#influences performance (model2 below), before running the same analyses but with
#cognitive task B (model3 below) - in this example I have just two cognitive
#tasks, but in my real work I have many more IVs to test (let's in this case
#just say it would be more cognitive tasks). Final thing I plan to add an
#individual slope and intercept based on the id variable
#time alone and soccer task performance
model1 <- blmer(DV ~ wave + (1 | id), data = df.long, REML = FALSE,
fixef.prior = normal)
summary(model1)
#new experimental model with cognitive tasks A performance added
model2 <- blmer(DV ~ IV1. + wave + (1 | id), data = df.long, REML = FALSE,
fixef.prior = normal)
summary(model2)
anova(model1, model2)
#a similar experimental model with cogntive tasks B performance instead of A
model3 <- blmer(DV ~ IV2. + wave + (1 | id), data = df.long, REML = FALSE,
fixef.prior = normal)
summary(model3)
anova(model1, model3)
#in the real data I then have many more models with IV1. or IV2. changed for
#another independent variable (e.g., IV3. or IV4.) and as a result the code
#is very long. I'm wanting to know, can the above be put together in fewer
#lines of code. What I've been reading is maybe that I could loop somewhere
#so that "IV.*" is replaced each time?
#thanks in advance for any help!
因此,如果您有任何方法可以在此示例中基本运行Model1,Model2和Model3的代码3,如果代码较少行,那就太好了。
I am looking to improve the readability of my code by seeing if there is a way to "loop" or "re-run" lines of code that are very similar but differ by a single variable each time.
My actual data analyses involves running a number of blmer
calls from the blme
package. Each of my analyses has a dependent variable, an independent variable (of which there are many), a "wave" variable (as data was collected over 3 timepoints), and unique participant id as a random effect.
I'm trying to build a number of models, all of which are very similar, but each differs on what is entered as the independent variable.
In the below code, I have outlined some more details, built a new, fictitious, data file, and tried to recreate models similar to those in my actual file.
The code runs without problem on my real data and here in the fictitious data. What I'd like to draw attention to here is how even with just 3 models included (as is the case in my example below) the code begins to become long and repetitive.
##test script##
library(dplyr)
library(tidyverse)
library(blme)
#packages loaded - I'm not sure these three are exactly needed, I just loaded
#dplyr and tidyverse incase...but blme is for the Bayesian models coming later
#everything below worked on RStudio on my end but, I like I say, I don't
#know if that is because of the above packages or not...
##build a file
DV0 <- c(100, 50, 75, 80, 20, 30) #let's say performance on a soccer task at time 1 - max 100
DV1 <- c(100, 60, 80, 80, 25, 40) #performance on soccer task at time 2
DV2 <- c(95, 55, 70, 70, 20, 35) #performance on soccer task at time 3
IV1.0 <- c(90, 60, 65, 75, 40, 50) #score on cognitive task A at time 1 - max 100
IV1.1 <- c(95, 70, 75, 80, 50, 70) #score on cog task A at time 2
IV1.2 <- c(90, 55, 60, 70, 45, 60) #score on cog task A at time 3
IV2.0 <- c(10, 40, 50, 60, 20, 25) #score on cognitive task B at time 1 - max 100
IV2.1 <- c(20, 50, 60, 75, 35, 35) #score on cog task B at time 2
IV2.2 <- c(15, 40, 40, 55, 25, 25) #score on cos task B at time 3
id <- c("Jon", "Sara", "Lisa", "Tim", "Joe", "Paul")
##create a data frame before pivot to a better format for longitudinal data
df <- data.frame(DV0, DV1, DV2, IV1.0, IV1.1, IV1.2, IV2.0, IV2.1, IV2.2,
id)
df.long <- long_panel(df, begin = 0, end = 2, label_location = "end")
#now onto the main analyses
#here I want to use "blmer" from "blme" package to understand how performance
#on the soccer task first is affected by time alone (model1 below).
#Next,I want to check whether adding performance on cognitive task A
#influences performance (model2 below), before running the same analyses but with
#cognitive task B (model3 below) - in this example I have just two cognitive
#tasks, but in my real work I have many more IVs to test (let's in this case
#just say it would be more cognitive tasks). Final thing I plan to add an
#individual slope and intercept based on the id variable
#time alone and soccer task performance
model1 <- blmer(DV ~ wave + (1 | id), data = df.long, REML = FALSE,
fixef.prior = normal)
summary(model1)
#new experimental model with cognitive tasks A performance added
model2 <- blmer(DV ~ IV1. + wave + (1 | id), data = df.long, REML = FALSE,
fixef.prior = normal)
summary(model2)
anova(model1, model2)
#a similar experimental model with cogntive tasks B performance instead of A
model3 <- blmer(DV ~ IV2. + wave + (1 | id), data = df.long, REML = FALSE,
fixef.prior = normal)
summary(model3)
anova(model1, model3)
#in the real data I then have many more models with IV1. or IV2. changed for
#another independent variable (e.g., IV3. or IV4.) and as a result the code
#is very long. I'm wanting to know, can the above be put together in fewer
#lines of code. What I've been reading is maybe that I could loop somewhere
#so that "IV.*" is replaced each time?
#thanks in advance for any help!
So, if you have any ways to essentially run the code for model1, model2, and model3 in this example if fewer lines of code, that would be great.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可以创建一个接收自变量作为字符串的函数,加上DF和其他选项,以及Leverages
as.formula()
。然后使用lapply()
将功能应用于每个自变量。在运行仅波浪模型时,您可以将“”
用作“自变量”(即模型1)。现在,获取一个名为
models
的列表,您可以运行您喜欢的任何ANOVA,如:
输出:
还有另一个选项,它是制作
df.long
,甚至“更长”,然后通过分组变量估算模型。 此操作的示例这是使用
data.table
输出进行You can create a function that receives the independent variable as a string, plus the df, and other options, and leverages
as.formula()
. Then apply the function to each of the your independent variables usinglapply()
. You can use""
as the "independent variable", when running the wave-only model (i.e. model 1).Now get a list called
models
You can run any anova you like, like this:
Output:
There is another option, which is to make
df.long
even "longer", and then estimate the models by the grouping variable. Here is an example of doing that withdata.table
Output is a data.table of models