Integrated kmers and adjusted reproducibility by MiriamBalzer · Pull Request #49 · IKIM-Essen/WIN-KID

MiriamBalzer · 2026-02-04T15:02:21Z

No description provided.

into integrated_kmers

…nistic logic for stacked approach -> reproducable results

…h different python versions

Julian-W98

Ich habe einige Unstimmigkeiten bezüglich constanten/variablennutzung gefunden. Auch eine große Code Dublikation in preprocessing.py ist mir aufgefallen.

Um zu überprüfen welcher Teil deines Codes für deterministisches Verhalten sorgt habe ich deine Änderungen einzeln getestet. Tatsächlich scheinen nur die beiden random_state=42 in classic_rf.py und stacked_rf.py den Unterschied zu machen. Zumindest hatte ich so mit den gleichen Daten im Modus TRAIN_TEST auf zwei verschiedenen Nodes das selbe Ergebniss. Alle weiteren Änderungen in stacked_rf.py und utils.py sollten daher aus meiner Sicht nicht vorgenommen werden weil sie nur unnötig die Komplexität erhöhen.

Sonst sind mir nur Kleinigkeiten aufgefallen

workflow/kmers.py

Julian-W98 · 2026-02-11T14:25:26Z

workflow/config.py

 MIN_SAMPLE_NUMBER = 15
 LOGGING_LEVEL = logging.INFO
 KMERE = False
+W2V_MODE = W2VMode.TUNE_W2V


Warum gibt es jetzt den W2VMode und RETRAIN_W2V?

Wäre davon ausgegangen, dass W2VMode.TRAIN_W2V das gleich ist wie RETRAIN_W2V = True

Vielleicht die Benennung hier etwas anpassen um das Unterscheidbar zu machen wenn es wirklich beides braucht

Ja, das ist dem Wachstum der implementation geschuldet. dadurch, dass ich nachträglich die Möglichkeit hinzugefügt habe das trainierte Modell zu speichern und wieder abzurufen. Ich schau mal, ob ich das etwas eindeutiger und weniger repetativ implementiert bekomme :)

Julian-W98 · 2026-02-11T14:26:55Z

workflow/execution_modes.py

    TUNE_HYPERPARAMETER = "tune_hyperparameter"
+
+
+class W2VMode(Enum):


Ist es gewollte das man jetzt W2V auf Train und den Execution Mode auf Predict stellen kann?

Hätte jetzt eher vermutet, dass wenn k-mere auf an gestellt wird und Execution Mode auf training steht der V2W auch mittrainiert wird.

theoretisch kann W2V immer mit laufen, sobald das preprozessing angeschmissen wird, da es ja eben ein vorverarbeitungsschritt ist. Natürlich ist das aber nicht immer sinnvoll 😅 allerdings auch nicht wirklich schlimm. ich schau es mir aber nochmal an

workflow/preprocessing.py

workflow/tune_w2v.py

workflow/stacked_rf.py

workflow/utils.py

…m, normal random forest deleted, changes in utils.py undone, minor changes according to PR comments

MiriamBalzer and others added 11 commits December 15, 2025 18:12

first adjustments for implimenting hyperparameter tuning

2f8b192

Merge branch 'integrated_kmers' of https://github.com/IKIM-Essen/WIN-KID

a359ab4

into integrated_kmers

kmers fully integrated into all preprocessing paths. adjusted determi…

87ed609

…nistic logic for stacked approach -> reproducable results

formatting

a4784e7

corrected config.py conflict

cbcc945

Merge branch 'development' into integrated_kmers

f7be3a3

unittest error fixed

a1098c9

adjusted def save_prediction_results in utils.py to handle pandas wit…

e841b49

…h different python versions

formatting

6d1605c

formatting

bd60787

formatting, again

5dea87b

Julian-W98 requested changes Feb 12, 2026

View reviewed changes

MiriamBalzer added 3 commits March 3, 2026 13:10

w2v tuning now in controller.py, reduced in preprocessing to a minimu…

4b47d9a

…m, normal random forest deleted, changes in utils.py undone, minor changes according to PR comments

adjustments for unitest

a735b64

adjusted git CI

31701d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrated kmers and adjusted reproducibility #49

Integrated kmers and adjusted reproducibility #49
MiriamBalzer wants to merge 14 commits intodevelopmentfrom
integrated_kmers

MiriamBalzer commented Feb 4, 2026

Uh oh!

Julian-W98 left a comment

Uh oh!

Uh oh!

Uh oh!

Julian-W98 Feb 11, 2026

Uh oh!

MiriamBalzer Feb 12, 2026

Uh oh!

Julian-W98 Feb 11, 2026

Uh oh!

MiriamBalzer Feb 12, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		TUNE_HYPERPARAMETER = "tune_hyperparameter"


		class W2VMode(Enum):

Conversation

MiriamBalzer commented Feb 4, 2026

Uh oh!

Julian-W98 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Julian-W98 Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

MiriamBalzer Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Julian-W98 Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

MiriamBalzer Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants