We propose a method to mark the comments of license as sentence-token. We use the term sentence-token to refer to a sentence of a known license. A license (both by-inclusion or by-reference) is a sequence of sentence-tokens. Sentence-tokens are generalized using one or more regular expressions. we propose an idea for license identification based on the analysis of each sentence in the license statement of a source code file.