tag | 87f4e14ace070c9b1fc3eef4e964f0bfc6e14cfa | |
---|---|---|
tagger | The Android Open Source Project <initial-contribution@android.com> | Tue May 16 21:52:56 2023 -0700 |
object | 39e7cc969087d555a9233ad5e63bee5cb8a7b8b2 |
Platform Tools Release 34.0.3 (9979309)
commit | 39e7cc969087d555a9233ad5e63bee5cb8a7b8b2 | [log] [tgz] |
---|---|---|
author | Android Build Coastguard Worker <android-build-coastguard-worker@google.com> | Thu Apr 20 16:01:46 2023 +0000 |
committer | Android Build Coastguard Worker <android-build-coastguard-worker@google.com> | Thu Apr 20 16:01:46 2023 +0000 |
tree | a0d9a4a2ded80048934aa0849e532d4d92a3bc0d | |
parent | 20437486b256b1f014a528899fc7eceecd0ea259 [diff] | |
parent | 355107e119e8f73bc755ae867b32bd55a4874bbf [diff] |
Snap for 9979206 from 355107e119e8f73bc755ae867b32bd55a4874bbf to sdk-release Change-Id: I385c0ef0d3f676958a44cb4dd63bdc9048df59db
The license classifier is a library and set of tools that can analyze text to determine what type of license it contains. It searches for license texts in a file and compares them to an archive of known licenses. These files could be, e.g., LICENSE
files with a single or multiple licenses in it, or source code files with the license text in a comment.
A “confidence level” is associated with each result indicating how close the match was. A confidence level of 1.0
indicates an exact match, while a confidence level of 0.0
indicates that no license was able to match the text.
Adding a new license is straight-forward:
Create a file in licenses/
.
.header
” to it. See licenses/README.md
for more details.Add the license name to the list in license_type.go
.
Regenerate the licenses.db
file by running the license serializer:
$ license_serializer -output licenseclassifier/licenses
Create and run appropriate tests to verify that the license is indeed present.
identify_license
is a command line tool that can identify the license(s) within a file.
$ identify_license LICENSE LICENSE: GPL-2.0 (confidence: 1, offset: 0, extent: 14794) LICENSE: LGPL-2.1 (confidence: 1, offset: 18366, extent: 23829) LICENSE: MIT (confidence: 1, offset: 17255, extent: 1059)
The license_serializer
tool regenerates the licenses.db
archive. The archive contains preprocessed license texts for quicker comparisons against unknown texts.
$ license_serializer -output licenseclassifier/licenses
This is not an official Google product (experimental or otherwise), it is just code that happens to be owned by Google.