Web UI #634

hasan7n · 2024-12-21T21:09:43Z

Comments/discussions to check:

To be merged before merging to main: #618, which depends on #615

…d add placeholders + tooltips to forms.

cli/medperf/web_ui/benchmarks/routes.py

cli/medperf/web_ui/containers/routes.py

cli/medperf/web_ui/datasets/routes.py

cli/medperf/web_ui/api/routes.py

+    # List directories inside the path and sort them
+    sorted_folders = []
+    sorted_files = []
+    for item in os.listdir(full_path):


To fix the issue, we will enhance the validation of the path parameter to ensure it does not allow directory traversal attacks. Specifically:

Normalize the path using os.path.normpath to remove any .. segments.

Verify that the normalized full_path starts with BASE_DIR to ensure it is contained within the base directory.

Raise an exception if the validation fails.

This approach ensures that even if a malicious user provides a crafted input, the resulting path will not escape the intended base directory.

cli/medperf/web_ui/api/routes.py

+    sorted_files = []
+    for item in os.listdir(full_path):
+        item_path = os.path.join(full_path, item)
+        if os.path.isdir(item_path):


To fix the issue, we need to ensure that the full_path is normalized before performing the containment check against BASE_DIR. This can be achieved by using os.path.normpath or os.path.realpath on the full_path before comparing it with BASE_DIR. This normalization step will resolve any .. sequences or symbolic links in the path, ensuring that the containment check is robust.

Additionally, we should ensure that the full_path is not only within the BASE_DIR but also a valid directory before proceeding with further operations.

cli/medperf/web_ui/api/routes.py

+    folders = []
+    for item in sorted_items:
+        item_path = os.path.join(full_path, item)
+        if os.path.isdir(item_path):


To fix the issue, we need to ensure that the path parameter is properly sanitized and validated before being used to construct file paths. This involves normalizing the path using os.path.normpath and ensuring that the resulting path is strictly within the BASE_DIR. Additionally, we should validate item_path to ensure it does not point outside the allowed directory structure.

Steps to fix:

Normalize path using os.path.normpath before constructing full_path.

Revalidate that the normalized full_path starts with BASE_DIR after normalization.

Ensure that item_path is also validated to prevent any potential misuse.

cli/medperf/web_ui/security_check.py

+    # Check if user is already authenticated
+    if token == security_token:
+        # User is already authenticated, redirect to original URL
+        return RedirectResponse(url=redirect_url, status_code=status.HTTP_302_FOUND)


cli/medperf/web_ui/security_check.py

+    redirect_url: str = Form("/"),
+):
+    if token == security_token:
+        response = RedirectResponse(url=redirect_url, status_code=status.HTTP_302_FOUND)


To fix the issue, we need to validate the redirect_url parameter before using it in the RedirectResponse. A safe approach is to ensure that the redirect_url is either a relative URL or matches a predefined list of allowed URLs. This can be achieved using Python's urlparse module to check that the redirect_url does not include an explicit host name or scheme, ensuring it is a relative path. If the validation fails, the application should redirect to a default safe URL (e.g., the home page).

The changes will be made in the access_web_ui function where the redirect_url is used. We will also add a utility function to perform the validation.

cli/medperf/web_ui/security_check.py

+):
+    if token == security_token:
+        response = RedirectResponse(url=redirect_url, status_code=status.HTTP_302_FOUND)
+        response.set_cookie(key=AUTH_COOKIE_NAME, value=token)


To fix the issue, the set_cookie method should explicitly set the secure, httponly, and samesite attributes. Specifically:

secure=True ensures the cookie is only sent over HTTPS.

httponly=True prevents JavaScript from accessing the cookie.

samesite='Lax' or samesite='Strict' mitigates CSRF risks by restricting cross-origin requests.

The fix involves modifying the set_cookie call on line 35 to include these attributes.

cli/medperf/web_ui/security_check.py

+):
+    if token == security_token:
+        response = RedirectResponse(url=redirect_url, status_code=status.HTTP_302_FOUND)
+        response.set_cookie(key=AUTH_COOKIE_NAME, value=token)


To fix the issue, we need to ensure that the token value is sanitized or validated before being used to construct the cookie. A secure approach would involve:

Validating the token against a predefined set of acceptable values or formats.

Ensuring that the token does not contain any malicious or unexpected content.

Optionally, encoding the token to prevent injection attacks.

In this case, we will validate the token by ensuring it matches the security_token and then use a secure, predefined value (e.g., security_token) to set the cookie instead of the raw user input.

cli/medperf/commands/dataset/import_dataset.py


        # raw_data_path should be provided if the imported dataset is in dev
        if self.dataset.state == "DEVELOPMENT" and (
-            self.raw_data_path is None or os.path.exists(self.raw_data_path)
+            self.raw_data_path is None
+            or os.path.isfile(self.raw_data_path)


To fix the issue, we need to validate the raw_data_path to ensure it is within a predefined safe root directory. This can be achieved by:

Normalizing the raw_data_path using os.path.realpath or Path.resolve to remove any .. segments or symbolic links.

Verifying that the normalized path starts with a predefined safe root directory (e.g., a directory dedicated to storing raw data).

Raising an exception if the validation fails.

The validation should be added in the validate_input method of the ImportDataset class, as this is where the input parameters are initially checked.

cli/medperf/commands/dataset/import_dataset.py

-            self.raw_data_path is None or os.path.exists(self.raw_data_path)
+            self.raw_data_path is None
+            or os.path.isfile(self.raw_data_path)
+            or (os.path.exists(self.raw_data_path) and os.listdir(self.raw_data_path))


To fix the issue, we need to validate the raw_data_path to ensure it is within a safe root directory and does not allow directory traversal. This can be achieved by:

Defining a safe root directory for raw_data_path.

Normalizing the user-provided path using os.path.normpath or Path.resolve to eliminate any .. segments.

Verifying that the normalized path starts with the safe root directory.

The changes will be made in the validate_input method of the ImportDataset class in cli/medperf/commands/dataset/import_dataset.py. Additionally, we will update the import_dataset function in cli/medperf/web_ui/datasets/routes.py to define a safe root directory for raw_data_path.

cli/medperf/commands/dataset/import_dataset.py

-            self.raw_data_path is None or os.path.exists(self.raw_data_path)
+            self.raw_data_path is None
+            or os.path.isfile(self.raw_data_path)
+            or (os.path.exists(self.raw_data_path) and os.listdir(self.raw_data_path))


To fix the issue, we need to validate the raw_data_path to ensure it is safe to use. This involves:

Normalizing the path using os.path.normpath or Path.resolve to remove any .. segments or symbolic links.

Ensuring the normalized path is contained within a predefined safe root directory (e.g., a specific directory for raw data).

Raising an exception if the path is invalid or outside the allowed directory.

The changes will be made in the validate_input method of the ImportDataset class in cli/medperf/commands/dataset/import_dataset.py.

mhmdk0 · 2025-05-28T18:39:30Z

Notes:
Should we do some modifications like:

Change notifications to job lists (running, finished, etc..)
Changes in tooltips and placeholders to match medperf --help (Place the text in separate file).
Refactor CSS as needed
model comp. test running multiple times causes an error (found existing predictions)
Notifications/Finished tasks logs (last 10 maybe) to be saved in database

VukW added 30 commits July 11, 2024 10:34

added fastapi as web ui backend

040d446

Added cube + benchmark basic listing

97e6bd7

Adds navigation

0382684

Aded mlcube detailed page

55fe60e

Improved mlcubes detailed layout

fb1bca3

Improved mlcube layout

64cf53e

yaml displaying

36611e1

yaml: spinner

56fa5c4

yaml panel improvement

8563887

yaml panel layout improvement

07ce4ab

layout fixes

b260401

Added benchmark detailed page

b7980a8

added links to mlcube

ca356cc

benchmark page: added owner

6efd724

Colors refactoring

319b1bf

Dataset detailed page

58008f3

Forgot to add js file

375d89e

Unified data format for all data fields automatically

c6d8a56

(mlcube-detailed) Display image tarball and additional files always

74f7743

Fixed scrolling and reinvented basic page layout

b312882

Fix navbar is hiding

0e282cb

Make templates & static files independent of user's workdir

6b28ebb

Added error handling

881b281

Display invalid entities correctly

e28107b

Added invalid entities highlighting + badges

5b718eb

Added benchmark associations

0f95027

Improved association panel style

444786e

Added association card

e273577

Sorted associations by status / timestamp

eea1e77

Sorted mlcubes and datasets: mine first

7b68911

mhmdk0 had a problem deploying to testing-external-code March 23, 2025 01:42 — with GitHub Actions Failure

change MLCube -> Container, Demo -> Reference, Submit -> Register, an…

ac3d0b3

…d add placeholders + tooltips to forms.

mhmdk0 had a problem deploying to testing-external-code April 3, 2025 21:13 — with GitHub Actions Failure

github-advanced-security bot found potential problems Apr 3, 2025

View reviewed changes

Implement logs/notifications

cdd049f

mhmdk0 had a problem deploying to testing-external-code April 15, 2025 16:19 — with GitHub Actions Failure

front-end refactoring

ed1d25f

mhmdk0 had a problem deploying to testing-external-code April 23, 2025 00:12 — with GitHub Actions Failure

mhmdk0 added 9 commits April 25, 2025 18:51

modals refactoring + modifications for maintainability

73df291

Front end refactoring + Bug fixes

6a26f74

add confirmation popup before running tasks

8c8c6c3

add security check help

db5e4f5

add file/folder browsing

b8ee61b

fix dataset/model association cancellation for web-ui

8a86944

profile activation fix, enhancements

15ea1c4

add notification for prompt / bug fixes

b6fcda9

design changes for associations in benchmark details

4618ab3

mhmdk0 requested a deployment to testing-external-code May 15, 2025 11:30 — with GitHub Actions Waiting

mhmdk0 added 4 commits May 23, 2025 02:29

Merge remote-tracking branch 'origin/main' into web-ui

227c509

update web-ui according to cli changes

c65a0cf

add import/export to web-ui

bc1f3ad

prevent multiple tasks from running - web-ui backend

d0394bf

mhmdk0 requested a deployment to testing-external-code May 26, 2025 20:41 — with GitHub Actions Waiting

github-advanced-security bot found potential problems May 26, 2025

View reviewed changes

change how web-ui display actions depending on entity owner

47a8c33

mhmdk0 requested a deployment to testing-external-code May 27, 2025 14:55 — with GitHub Actions Waiting

improve dataset import and fix its tests

f81a57d

mhmdk0 requested a deployment to testing-external-code May 27, 2025 17:15 — with GitHub Actions Waiting

github-advanced-security bot found potential problems May 27, 2025

View reviewed changes

@@ -21,10 +21,11 @@
             ):
-                full_path = os.path.abspath(os.path.join(BASE_DIR, path))
-                if not os.path.exists(full_path) or not os.path.isdir(full_path):
-                    raise HTTPException(status_code=404, detail="Directory not found")
-                # Ensure path is within the base directory
-                if not os.path.commonpath([BASE_DIR, full_path]) == BASE_DIR:
-                    raise HTTPException(status_code=403, detail="Access denied")
+                # Normalize the path to prevent directory traversal
+                normalized_path = os.path.normpath(path)
+                full_path = os.path.abspath(os.path.join(BASE_DIR, normalized_path))
+                if not full_path.startswith(BASE_DIR):
+                    raise HTTPException(status_code=403, detail="Access denied")
+                if not os.path.exists(full_path) or not os.path.isdir(full_path):
+                    raise HTTPException(status_code=404, detail="Directory not found")

@@ -21,10 +21,10 @@
             ):
-                full_path = os.path.abspath(os.path.join(BASE_DIR, path))
-                if not os.path.exists(full_path) or not os.path.isdir(full_path):
-                    raise HTTPException(status_code=404, detail="Directory not found")
-                # Ensure path is within the base directory
-                if not os.path.commonpath([BASE_DIR, full_path]) == BASE_DIR:
-                    raise HTTPException(status_code=403, detail="Access denied")
+                full_path = os.path.realpath(os.path.join(BASE_DIR, path))
+                # Ensure path is within the base directory
+                if not full_path.startswith(BASE_DIR):
+                    raise HTTPException(status_code=403, detail="Access denied")
+                if not os.path.exists(full_path) or not os.path.isdir(full_path):
+                    raise HTTPException(status_code=404, detail="Directory not found")

@@ -21,10 +21,11 @@
             ):
-                full_path = os.path.abspath(os.path.join(BASE_DIR, path))
-                if not os.path.exists(full_path) or not os.path.isdir(full_path):
-                    raise HTTPException(status_code=404, detail="Directory not found")
-                # Ensure path is within the base directory
-                if not os.path.commonpath([BASE_DIR, full_path]) == BASE_DIR:
-                    raise HTTPException(status_code=403, detail="Access denied")
+                normalized_path = os.path.normpath(path)
+                full_path = os.path.abspath(os.path.join(BASE_DIR, normalized_path))
+                if not os.path.exists(full_path) or not os.path.isdir(full_path):
+                    raise HTTPException(status_code=404, detail="Directory not found")
+                # Ensure path is within the base directory
+                if not os.path.commonpath([BASE_DIR, full_path]) == BASE_DIR:
+                    raise HTTPException(status_code=403, detail="Access denied")
@@ -48,4 +49,7 @@
                 for item in sorted_items:
-                    item_path = os.path.join(full_path, item)
-                    if os.path.isdir(item_path):
+                    item_path = os.path.join(full_path, item)
+                    # Ensure item_path is within full_path
+                    if not os.path.commonpath([full_path, item_path]) == full_path:
+                        continue
+                    if os.path.isdir(item_path):
                         folders.append({"name": item, "path": item_path, "type": "dir"})

@@ -5,4 +5,9 @@
             from medperf.web_ui.common import templates, api_key_cookie
-            router = APIRouter()
+            from urllib.parse import urlparse
+            def is_safe_redirect_url(url: str) -> bool:
+                """Validate that the URL is a relative path or matches allowed hosts."""
+                url = url.replace("\\", "")  # Normalize backslashes
+                parsed = urlparse(url)
+                return not parsed.netloc and not parsed.scheme
@@ -32,6 +37,8 @@
             ):
-                if token == security_token:
-                    response = RedirectResponse(url=redirect_url, status_code=status.HTTP_302_FOUND)
-                    response.set_cookie(key=AUTH_COOKIE_NAME, value=token)
-                    return response
+                if token == security_token:
+                    if not is_safe_redirect_url(redirect_url):
+                        redirect_url = "/"  # Default to home page if validation fails
+                    response = RedirectResponse(url=redirect_url, status_code=status.HTTP_302_FOUND)
+                    response.set_cookie(key=AUTH_COOKIE_NAME, value=token)
+                    return response
                 else:

@@ -34,3 +34,3 @@
                     response = RedirectResponse(url=redirect_url, status_code=status.HTTP_302_FOUND)
-                    response.set_cookie(key=AUTH_COOKIE_NAME, value=token)
+                    response.set_cookie(key=AUTH_COOKIE_NAME, value=token, secure=True, httponly=True, samesite='Lax')
                     return response

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Web UI #634

Web UI #634

Uh oh!

hasan7n commented Dec 21, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check warning

Check warning

Copilot Autofix

Check warning

Copilot Autofix

Check warning

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

Check failure

Copilot Autofix

mhmdk0 commented May 28, 2025

Uh oh!

Uh oh!

@@ -32,10 +32,22 @@
                     # raw_data_path should be provided if the imported dataset is in dev
-                    if self.dataset.state == "DEVELOPMENT" and (
-                        self.raw_data_path is None
-                        or os.path.isfile(self.raw_data_path)
-                        or (os.path.exists(self.raw_data_path) and os.listdir(self.raw_data_path))
-                    ):
-                        raise InvalidArgumentError(
-                            "Output raw data path must be specified and, the directory should be empty or does not exist."
-                        )
+                    if self.dataset.state == "DEVELOPMENT":
+                        if self.raw_data_path is None:
+                            raise InvalidArgumentError(
+                                "Output raw data path must be specified."
+                            )
+                        # Normalize and validate raw_data_path
+                        safe_root = config.raw_data_storage  # Define a safe root directory in the config
+                        normalized_path = str(Path(self.raw_data_path).resolve())
+                        if not normalized_path.startswith(str(Path(safe_root).resolve())):
+                            raise InvalidArgumentError(
+                                f"Invalid raw data path: {self.raw_data_path}. Path must be within {safe_root}."
+                            )
+                        if os.path.isfile(normalized_path) or (
+                            os.path.exists(normalized_path) and os.listdir(normalized_path)
+                        ):
+                            raise InvalidArgumentError(
+                                "Output raw data path must be an empty directory or not exist."
+                            )

@@ -434,3 +434,3 @@
                 try:
-                    ImportDataset.run(dataset_id, input_path, raw_dataset_path)
+                    ImportDataset.run(dataset_id, input_path, raw_dataset_path or config.raw_data_storage)
                     return_response["status"] = "success"

@@ -434,3 +434,8 @@
                 try:
-                    ImportDataset.run(dataset_id, input_path, raw_dataset_path)
+                    # Define a safe root directory for raw_dataset_path
+                    safe_root = config.safe_root  # Safe root directory defined in config
+                    if raw_dataset_path:
+                        raw_dataset_path = str(Path(safe_root).joinpath(raw_dataset_path).resolve())
+                    ImportDataset.run(dataset_id, input_path, raw_dataset_path)
                     return_response["status"] = "success"

Web UI #634

Are you sure you want to change the base?

Web UI #634

Uh oh!

Conversation

hasan7n commented Dec 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check warning

Uh oh!

Check warning

Uh oh!

Copilot Autofix

Check warning

Copilot Autofix

Check warning

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

Check failure

Uh oh!

Copilot Autofix

mhmdk0 commented May 28, 2025

Uh oh!

Uh oh!

hasan7n commented Dec 21, 2024 •

edited

Loading