Skip to content

Commit 70ece01

Browse files
authored
Update README.md
1 parent bc2d7f8 commit 70ece01

File tree

1 file changed

+58
-9
lines changed

1 file changed

+58
-9
lines changed

README.md

Lines changed: 58 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,66 @@
1-
# NetworkAnalysis
1+
# CoAuthorship - Social Network Analysis
22

3-
Welcome to the Social Network Analysis (SNA) repository! This project provides tools and code to perform social network analysis, generate insightful reports, and visualize interactions within a network.
3+
Welcome to the **Social Network Analysis (SNA) repository!**
4+
This project focuses on *co-authorship network analysis*, a powerful method for visualising and analysing collaboration patterns among researchers. By representing authors as nodes and their co-authorships as edges, we construct a network that reveals clusters of closely connected researchers, highlights central figures, and identifies critical "cutpoint" authors whose removal would fragment the network.
45

5-
## Data format
6-
Data should be input in .csv format, with one column containing author names.
6+
This repository provides tools and code to perform social network analysis, generate insightful reports, and visualise interactions within a network.
77

8-
It is also possible to use a .net file generated by the VOSviewer software.
98

10-
## Modifying the report
11-
Run `Generate_report.r` to generate the report in markdown and pdf formats (`Report.md` and `Report.pdf`). Note: generating the pdf requires latex and pandoc to be installed on the system.
9+
---
10+
11+
## Instructions for Use
12+
13+
### 1. Data format
14+
The input data should be in `.csv` format, with one column containing author names. This CSV file can be exported from databases such as PubMed, Scopus, Web of Science, or university research databases.
15+
16+
### 2. Clone the repository
17+
Clone the entire repository to your computer using the `git clone` command or the `GitHub Desktop` application.
18+
19+
### 3. Ruuning the repository
20+
21+
#### 3.1 intsall Packages:
22+
Ensure that the following libraries are installed on your computer:
23+
(the list of required packages here.)
24+
25+
#### 3.2 Report generating:
26+
The primary file you will need is `Generate_report.R` . You can run this R script in RStudio or Visual Studio Code. Each function and line of code is well-documented, with instructions and explanations on how to modify it to suit your analysis needs.
27+
The code includes various filter functions that you may adjust depending on your research questions, such as:
28+
29+
- The maximum number of authors per paper in your dataset.
30+
- The minimum number of papers per author to be included in the analysis.
31+
- The timeframe (years) you wish to study.
32+
- Whether to split the analysis into smaller time intervals.
33+
34+
#### 3.3 output folder:
35+
After running the code, the output (including figures, PDFs, and additional CSV files) will be saved in a folder named `output`. This folder will be created automatically if it does not already exist.
36+
If you analyse multiple CSV files, a separate subfolder (under the same name of the csv file) will be created within the output folder for each dataset, containing the relevant results.
37+
38+
##### the generated outcomes:
39+
1. `Report_YYYY_YYYY.pdf`
40+
This is the main output of the automated code: a PDF report containing the analysis results. It includes a table of contents, detailed interpretations of the findings, and visualisations with captions.
41+
The corresponding markdown file, `Report_YYYY_YYYY.md`, is also generated.
42+
43+
2. `centrality_data_YYY-YYYY.csv`
44+
This CSV file contains centrality metrics for all authors in the co-authorship network. It includes four columns:
45+
- Author: Each row represents an author in the network, along with their centrality metrics.
46+
- Degree: The number of direct co-authorship connections an author has in the network.
47+
- Closeness: Indicates how quickly an author can reach others in the network, reflecting their proximity to all other authors.
48+
- Betweenness: Measures the extent to which an author acts as a bridge between other researchers, highlighting their role in connecting different parts of the network.
49+
50+
Use this file to identify influential authors (high degree), those central to information flow (high closeness), and key connectors (high betweenness).
51+
52+
3. `figures` Folder
53+
This folder contains three PNG images:
54+
- graph.png (Figure 1 in the report): Visualisation of the co-authorship network.
55+
- top_authors.png (Figure 2 in the report): Direct connections between the top 15 most central authors.
56+
- cutpoints.png (Figure 3 in the report): Highlighted cutpoint authors in the network.
57+
58+
---
59+
## Repository Contents
60+
61+
62+
---
1263

13-
### Changing text
14-
Report text can be edited in `src/Report_text.r`.
1564

1665
### Adding new metrics
1766
To compute new metrics, edit `src/Interactions_class.r` to add a new method (both functions need to be added):

0 commit comments

Comments
 (0)