|
1 | | -# NetworkAnalysis |
| 1 | +# CoAuthorship - Social Network Analysis |
2 | 2 |
|
3 | | -Welcome to the Social Network Analysis (SNA) repository! This project provides tools and code to perform social network analysis, generate insightful reports, and visualize interactions within a network. |
| 3 | +Welcome to the **Social Network Analysis (SNA) repository!** |
| 4 | +This project focuses on *co-authorship network analysis*, a powerful method for visualising and analysing collaboration patterns among researchers. By representing authors as nodes and their co-authorships as edges, we construct a network that reveals clusters of closely connected researchers, highlights central figures, and identifies critical "cutpoint" authors whose removal would fragment the network. |
4 | 5 |
|
5 | | -## Data format |
6 | | -Data should be input in .csv format, with one column containing author names. |
| 6 | +This repository provides tools and code to perform social network analysis, generate insightful reports, and visualise interactions within a network. |
7 | 7 |
|
8 | | -It is also possible to use a .net file generated by the VOSviewer software. |
9 | 8 |
|
10 | | -## Modifying the report |
11 | | -Run `Generate_report.r` to generate the report in markdown and pdf formats (`Report.md` and `Report.pdf`). Note: generating the pdf requires latex and pandoc to be installed on the system. |
| 9 | +--- |
| 10 | + |
| 11 | +## Instructions for Use |
| 12 | + |
| 13 | +### 1. Data format |
| 14 | +The input data should be in `.csv` format, with one column containing author names. This CSV file can be exported from databases such as PubMed, Scopus, Web of Science, or university research databases. |
| 15 | + |
| 16 | +### 2. Clone the repository |
| 17 | +Clone the entire repository to your computer using the `git clone` command or the `GitHub Desktop` application. |
| 18 | + |
| 19 | +### 3. Ruuning the repository |
| 20 | + |
| 21 | +#### 3.1 intsall Packages: |
| 22 | +Ensure that the following libraries are installed on your computer: |
| 23 | +(the list of required packages here.) |
| 24 | + |
| 25 | +#### 3.2 Report generating: |
| 26 | +The primary file you will need is `Generate_report.R` . You can run this R script in RStudio or Visual Studio Code. Each function and line of code is well-documented, with instructions and explanations on how to modify it to suit your analysis needs. |
| 27 | +The code includes various filter functions that you may adjust depending on your research questions, such as: |
| 28 | + |
| 29 | +- The maximum number of authors per paper in your dataset. |
| 30 | +- The minimum number of papers per author to be included in the analysis. |
| 31 | +- The timeframe (years) you wish to study. |
| 32 | +- Whether to split the analysis into smaller time intervals. |
| 33 | + |
| 34 | +#### 3.3 output folder: |
| 35 | +After running the code, the output (including figures, PDFs, and additional CSV files) will be saved in a folder named `output`. This folder will be created automatically if it does not already exist. |
| 36 | +If you analyse multiple CSV files, a separate subfolder (under the same name of the csv file) will be created within the output folder for each dataset, containing the relevant results. |
| 37 | + |
| 38 | +##### the generated outcomes: |
| 39 | + 1. `Report_YYYY_YYYY.pdf` |
| 40 | + This is the main output of the automated code: a PDF report containing the analysis results. It includes a table of contents, detailed interpretations of the findings, and visualisations with captions. |
| 41 | + The corresponding markdown file, `Report_YYYY_YYYY.md`, is also generated. |
| 42 | + |
| 43 | + 2. `centrality_data_YYY-YYYY.csv` |
| 44 | + This CSV file contains centrality metrics for all authors in the co-authorship network. It includes four columns: |
| 45 | + - Author: Each row represents an author in the network, along with their centrality metrics. |
| 46 | + - Degree: The number of direct co-authorship connections an author has in the network. |
| 47 | + - Closeness: Indicates how quickly an author can reach others in the network, reflecting their proximity to all other authors. |
| 48 | + - Betweenness: Measures the extent to which an author acts as a bridge between other researchers, highlighting their role in connecting different parts of the network. |
| 49 | + |
| 50 | + Use this file to identify influential authors (high degree), those central to information flow (high closeness), and key connectors (high betweenness). |
| 51 | + |
| 52 | + 3. `figures` Folder |
| 53 | + This folder contains three PNG images: |
| 54 | + - graph.png (Figure 1 in the report): Visualisation of the co-authorship network. |
| 55 | + - top_authors.png (Figure 2 in the report): Direct connections between the top 15 most central authors. |
| 56 | + - cutpoints.png (Figure 3 in the report): Highlighted cutpoint authors in the network. |
| 57 | + |
| 58 | +--- |
| 59 | +## Repository Contents |
| 60 | + |
| 61 | + |
| 62 | +--- |
12 | 63 |
|
13 | | -### Changing text |
14 | | -Report text can be edited in `src/Report_text.r`. |
15 | 64 |
|
16 | 65 | ### Adding new metrics |
17 | 66 | To compute new metrics, edit `src/Interactions_class.r` to add a new method (both functions need to be added): |
|
0 commit comments