课题组每周研讨会
内容:
read.*与 write.*load 与 savereadRDS 与 saveRDShttps://github.com/tidyverse/
数据导入 read_*
管道 %>%
x %>% f(y) > f(x, y)筛选
slice, filter, sample_n, sample_frac, top_n, distinctselect
containsnum_rangestarts_withends_withone_ofmatchesarrange行列增加/更新
mutate, transmutemutate_add_rowadd_columnrenamerownames_to_column, column_to_rowname+ - * / > < ==dplyr:: lag leaddplyr:: cumall cumany cummax cummean cummin cumprod cumsumdplyr:: cume_dist dense_rank min_rank ntile percent_rank row_numberdplyr:: between case_when coalesce if_else na_if pmax pmin recode recode_factor汇总
countsummarizegroup_by, ungroupdplyr:: n n_distinct base::sum(!is.na())mean, meadianmean, sumdplyr:: first last nthquantile min maxIQR mad sd var合并
bind_rowsbind_colssemi_joinanti_joinleft_join, right_join, inner_join, full_joinintersectsetdiffunionsetequal 辅助查看两个数据集是否相同(不管行序)变异动词 (_at, _if, _all)
filter_*select_*summarize_*arrange_*字符处理
substrstringr包与正则表达式略微复杂,可以单独讲一次Tidy 数据格式


tibbletribble, enframeas_tibble, is_tibbledrop_nafillreplace_na长转宽 pivot_wider, spread

宽转长 pivot_longer, gather

expandcompleteseparateseparate_rowsunite数据导出
write_*freadfwritedt[i, j, by]base 与 stringrpurrrstats 与 broomgraphics 与 ggplot2apply家族和purrr等开发: