Skip to content

Commit ae20459

Browse files
committedSep 1, 2024··
Code Overhaul
PHP 8.1 is now required
1 parent e7b4e27 commit ae20459

15 files changed

+1811
-1681
lines changed
 

‎.github/workflows/Test.yml

+8-7
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,3 @@
1-
---
21
name: Test
32

43
on:
@@ -21,13 +20,13 @@ jobs:
2120
runs-on: ubuntu-latest
2221
strategy:
2322
matrix:
24-
php: [ '7.2', '7.3', '7.4', '8.0', '8.1' ]
25-
continue-on-error: ${{ matrix.php == '8.1' }}
23+
php: [ '8.1', '8.2', '8.3', '8.4' ]
24+
continue-on-error: ${{ matrix.php == '8.4' }}
2625
name: PHP ${{ matrix.php }} Test
2726

2827
steps:
2928
- name: Git checkout
30-
uses: actions/checkout@v2
29+
uses: actions/checkout@v4
3130

3231
- name: Checkout submodules
3332
run: git submodule update --init --recursive
@@ -47,10 +46,12 @@ jobs:
4746
run: composer update --prefer-dist --no-interaction
4847

4948
- name: Run phpcs
50-
run: composer cs-check -- -v
49+
run: composer cs-check
5150

5251
- name: Execute tests
53-
run: composer test -- -v --coverage-clover=coverage.xml
52+
run: composer test -- --coverage-clover=coverage.xml
5453

5554
- name: Run codecov
56-
uses: codecov/codecov-action@v1
55+
uses: codecov/codecov-action@v4
56+
with:
57+
token: ${{ secrets.CODECOV_TOKEN }}

‎.gitignore

+1
Original file line numberDiff line numberDiff line change
@@ -4,5 +4,6 @@ composer.lock
44
.idea
55
phpunit.xml
66
.phpunit.result.cache
7+
/.phpunit.cache
78
.phpcs-cache
89
.php_cs.cache

‎README.md

+44-62
Original file line numberDiff line numberDiff line change
@@ -1,115 +1,97 @@
11
# PHP Mime Detector
22

3-
Detecting the real type of (binary) files doesn't have to be hard. Checking a file's extension is not reliable and can cause serious security issues.
3+
Detecting the real type of (binary) files doesn't have to be difficult. Checking a file's extension is not reliable and can lead to serious security issues.
44

5-
This package helps you to determine the correct type of files, by reading it byte for byte (up to 4096) and check for [magic numbers](http://en.wikipedia.org/wiki/Magic_number_(programming)#Magic_numbers_in_files).
5+
This package helps you determine the correct type of files by reading them byte by byte (up to 4096 bytes) and checking for [magic numbers](http://en.wikipedia.org/wiki/Magic_number_(programming)#Magic_numbers_in_files).
66

7-
However, this package isn't a replacement for any security software. It just aims to produce less false positives, than a simple extension check would produce.
7+
However, this package isn't a replacement for any security software. It simply aims to produce fewer false positives than a simple extension check would.
88

99
A list of supported file types can be found on [this Wiki page](https://github.com/SoftCreatR/php-mime-detector/wiki/Supported-file-types).
1010

11-
__Why a separate class?__
11+
## Why a Separate Class?
1212

13-
You may wonder why we don't just rely on extensions like [Fileinfo](https://secure.php.net/manual/en/book.fileinfo.php)? First off all, a short background story:
13+
You may wonder why we don't just rely on extensions like [Fileinfo](https://www.php.net/manual/en/book.fileinfo.php). Here's a brief background:
1414

15-
We are building extensions and applications for an Open Source PHP Framework, thus we are creating web software for the masses. Many of our customers and/or users of our free products are on shared hosting without any possibility to install or manage installed PHP extensions. So our goal is to develop solutions with as few dependencies as necessary, but with as much functionality as possible.
15+
We develop extensions and applications for an Open Source PHP Framework, creating web software for the masses. Many of our customers and users of our free products are on shared hosting without any ability to install or manage PHP extensions. Therefore, our goal is to develop solutions with minimal dependencies while providing as much functionality as possible.
1616

17-
While developing a solution, that allows people to convert HEIF/HEIC files to a more "standardized" format (using our own, external API), we had troubles detecting these files, because especially this format isn't known by most Webservers, yet. While checking the file extension isn't reliable, we had to find a reusable solution that works for most of our clients. So we started to build a magic number based check for these files. That was the birth of our Mime Detector.
18-
19-
__Why are the unit tests so poorly written?__
20-
21-
Short answer: I have just little experience in unit testing. This project was a good training and even if the unit tests could be better: I am proud of my progress :)
22-
23-
## Demo
24-
25-
A demo (based on [dev-master](https://github.com/SoftCreatR/php-mime-detector/archive/master.zip)) can be found at [WhatTheFile.info](https://www.whatthefile.info).
17+
When developing a solution that allows people to convert HEIF/HEIC files to a more "standardized" format (using our own external API), we had trouble detecting these files because this format isn't widely recognized by most web servers. Since checking the file extension isn't reliable, we needed to find a reusable solution that works for most of our clients. This led to the creation of our Mime Detector, based on magic number checks.
2618

2719
## Requirements
2820

29-
- PHP 7.1 or newer
30-
- [Composer](https://getcomposer.org)
31-
32-
If you are looking for a solution that works on older PHP versions (5.3.2+), head over to the [oldphp](https://github.com/SoftCreatR/php-mime-detector/tree/oldphp) branch.
21+
- PHP 8.1 or newer
22+
- [Composer](https://getcomposer.org)
3323

3424
## Installation
3525

36-
Require this package using [Composer](https://getcomposer.org/), in the root directory of your project:
26+
Install this package using [Composer](https://getcomposer.org/) in the root directory of your project:
3727

38-
``` bash
39-
$ composer require softcreatr/php-mime-detector
28+
```bash
29+
composer require softcreatr/php-mime-detector
4030
```
4131

4232
## Usage
4333

44-
Here is an example on how this package makes it very easy to determine the mime type (and it's corresponding file extension) of a given file:
34+
Here is an example of how this package makes it easy to determine the MIME type and corresponding file extension of a given file:
4535

4636
```php
47-
use SoftCreatR\MimeDetector\MimeDetector;
48-
use SoftCreatR\MimeDetector\MimeDetectorException;
49-
50-
// create an instance of the MimeDetector
51-
$mimeDetector = new MimeDetector();
52-
53-
// set our file to read
54-
try {
55-
$mimeDetector->setFile('foo.bar');
56-
} catch (MimeDetectorException $e) {
57-
die('An error occurred while trying to load the given file.');
58-
}
37+
<?php
5938

60-
// try to determine it's mime type and the correct file extension
61-
$fileData = $mimeDetector->getFileType();
62-
63-
// print the result
64-
echo '<pre>' . print_r($fileData, true) . '</pre>';
65-
```
66-
67-
Or short:
68-
69-
```php
7039
use SoftCreatR\MimeDetector\MimeDetector;
7140
use SoftCreatR\MimeDetector\MimeDetectorException;
7241

42+
require 'vendor/autoload.php';
43+
7344
try {
74-
echo '<pre>' . print_r((new MimeDetector())->setFile('foo.bar')->getFileType(), true) . '</pre>';
45+
// Create an instance of MimeDetector with the file path
46+
$mimeDetector = new MimeDetector('foo.bar');
47+
48+
// Get the MIME type and file extension
49+
$fileData = [
50+
'mime_type' => $mimeDetector->getMimeType(),
51+
'file_extension' => $mimeDetector->getFileExtension(),
52+
'file_hash' => $mimeDetector->getFileHash(),
53+
];
54+
55+
// Print the result
56+
echo '<pre>' . print_r($fileData, true) . '</pre>';
7557
} catch (MimeDetectorException $e) {
76-
die('An error occurred while trying to load the given file.');
58+
die('An error occurred while trying to load the given file: ' . $e->getMessage());
7759
}
7860
```
7961

8062
## Testing
8163

82-
Testing utilizes PHPUnit (what else?) by running this command:
64+
This project uses PHPUnit for testing. To run tests, use the following command:
8365

84-
``` bash
85-
$ composer test
66+
```bash
67+
composer test
8668
```
8769

88-
However, you may check out a bunch of test files for a full test. Test files are no longer included in the composer package nor the Git repository itself, so you have to perform a checkout of this repository and install it's submodules:
70+
To run a full test suite, you can use a set of provided test files. These files are not included in the Composer package or Git repository, so you must clone this repository and initialize its submodules:
8971

90-
``` bash
91-
$ git clone https://github.com/SoftCreatR/php-mime-detector
92-
$ cd php-mime-detector
93-
$ git submodule update --init --recursive
72+
```bash
73+
git clone https://github.com/SoftCreatR/php-mime-detector
74+
cd php-mime-detector
75+
git submodule update --init --recursive
9476
```
9577

96-
When done, perform a `composer install` and run PHPUnit as described above.
78+
After that, install the necessary dependencies with `composer install`, and run PHPUnit as described above.
9779

9880
## ToDo
9981

100-
- Reduce method sizes, when possible
101-
- Add a method, that accepts a mime type and returns the corresponding file extension
102-
- Add a method, that accepts a file extension and returns a list of corresponding mime types
103-
- Add a method, that returns a list of all detectable mime types and their corresponding file extensions
82+
- Reduce method sizes where possible.
83+
- Add a method that accepts a MIME type and returns the corresponding file extension.
84+
- Add a method that accepts a file extension and returns a list of corresponding MIME types.
85+
- Add a method that returns a list of all detectable MIME types and their corresponding file extensions.
10486

10587
## Contributing
10688

10789
Please see [CONTRIBUTING](CONTRIBUTING.md) for details.
10890

109-
When adding new detections, please make sure to provide at least one sample file.
91+
When adding new detections, please provide at least one sample file to ensure the detection works as expected.
11092

11193
## License
11294

113-
[ISC](LICENSE.md)
95+
[ISC License](LICENSE.md)
11496

11597
Free Software, Hell Yeah!

‎composer.json

+5-5
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88
"Magic Number"
99
],
1010
"homepage": "https://1-2.dev",
11-
"version": "3.2.1",
11+
"version": "4.0.0",
1212
"license": "ISC",
1313
"authors": [
1414
{
@@ -21,12 +21,12 @@
2121
}
2222
],
2323
"require": {
24-
"php": ">=7.1.0"
24+
"php": ">=8.1.0"
2525
},
2626
"require-dev": {
27-
"phpunit/phpunit": ">=7.0",
28-
"roave/security-advisories": "dev-master",
29-
"squizlabs/php_codesniffer": "^3.5"
27+
"phpunit/phpunit": ">=10.0",
28+
"roave/security-advisories": "dev-latest",
29+
"squizlabs/php_codesniffer": "^3.7"
3030
},
3131
"config": {
3232
"optimize-autoloader": true,

‎phpunit.xml.dist

+11-15
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,13 @@
11
<?xml version="1.0" encoding="UTF-8"?>
2-
<phpunit xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
3-
xsi:noNamespaceSchemaLocation="./vendor/phpunit/phpunit/phpunit.xsd"
4-
bootstrap="vendor/autoload.php"
5-
colors="true"
6-
>
7-
<testsuites>
8-
<testsuite name="Unit">
9-
<directory>./tests</directory>
10-
</testsuite>
11-
</testsuites>
12-
<filter>
13-
<whitelist>
14-
<directory suffix=".php">./src/</directory>
15-
</whitelist>
16-
</filter>
2+
<phpunit xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="./vendor/phpunit/phpunit/phpunit.xsd" bootstrap="vendor/autoload.php" colors="true" cacheDirectory=".phpunit.cache" displayDetailsOnTestsThatTriggerWarnings="true">
3+
<testsuites>
4+
<testsuite name="Unit">
5+
<directory>./tests</directory>
6+
</testsuite>
7+
</testsuites>
8+
<source>
9+
<include>
10+
<directory>./src/</directory>
11+
</include>
12+
</source>
1713
</phpunit>
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,119 @@
1+
<?php
2+
3+
/**
4+
* Mime Detector for PHP.
5+
*
6+
* @license https://github.com/SoftCreatR/php-mime-detector/blob/main/LICENSE ISC License
7+
*/
8+
9+
declare(strict_types=1);
10+
11+
namespace SoftCreatR\MimeDetector;
12+
13+
class ByteCacheHandler
14+
{
15+
private array $byteCache = [];
16+
17+
private int $byteCacheLen = 0;
18+
19+
private int $maxByteCacheLen = 4096;
20+
21+
/**
22+
* @throws MimeDetectorException
23+
*/
24+
public function __construct(private readonly string $file)
25+
{
26+
$this->createByteCache();
27+
}
28+
29+
public function getByteCache(): array
30+
{
31+
return $this->byteCache;
32+
}
33+
34+
public function getByteCacheLen(): int
35+
{
36+
return $this->byteCacheLen;
37+
}
38+
39+
/**
40+
* @throws MimeDetectorException
41+
*/
42+
public function setMaxByteCacheLen(int $maxLength): void
43+
{
44+
if ($maxLength < 4) {
45+
throw new MimeDetectorException('Maximum byte cache length must not be smaller than 4.');
46+
}
47+
$this->maxByteCacheLen = $maxLength;
48+
}
49+
50+
public function getMaxByteCacheLen(): int
51+
{
52+
return $this->maxByteCacheLen;
53+
}
54+
55+
/**
56+
* @throws MimeDetectorException
57+
*/
58+
private function createByteCache(): void
59+
{
60+
if (empty($this->file)) {
61+
throw new MimeDetectorException('No file provided.');
62+
}
63+
64+
$handle = \fopen($this->file, 'rb');
65+
$data = \fread($handle, $this->maxByteCacheLen);
66+
\fclose($handle);
67+
68+
foreach (\str_split($data) as $i => $char) {
69+
$this->byteCache[$i] = \ord($char);
70+
}
71+
72+
$this->byteCacheLen = \count($this->byteCache);
73+
}
74+
75+
public function checkForBytes(array $bytes, int $offset = 0, array $mask = []): bool
76+
{
77+
if (empty($bytes) || empty($this->byteCache)) {
78+
return false;
79+
}
80+
81+
foreach (\array_values($bytes) as $i => $byte) {
82+
if (!empty($mask)) {
83+
if (
84+
!isset($this->byteCache[$offset + $i], $mask[$i])
85+
|| $byte !== ($mask[$i] & $this->byteCache[$offset + $i])
86+
) {
87+
return false;
88+
}
89+
} elseif (!isset($this->byteCache[$offset + $i]) || $this->byteCache[$offset + $i] !== $byte) {
90+
return false;
91+
}
92+
}
93+
94+
return true;
95+
}
96+
97+
public function searchForBytes(array $bytes, int $offset = 0, array $mask = []): int
98+
{
99+
$limit = $this->byteCacheLen - \count($bytes);
100+
101+
for ($i = $offset; $i < $limit; $i++) {
102+
if ($this->checkForBytes($bytes, $i, $mask)) {
103+
return $i;
104+
}
105+
}
106+
107+
return -1;
108+
}
109+
110+
public function checkString(string $str, int $offset = 0): bool
111+
{
112+
return $this->checkForBytes($this->toBytes($str), $offset);
113+
}
114+
115+
public function toBytes(string $str): array
116+
{
117+
return \array_values(\unpack('C*', $str));
118+
}
119+
}

0 commit comments

Comments
 (0)
Please sign in to comment.