To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?z]]nf?z]]n^}Y?z]]nf?z]]n^}bE 0011111101111010010111010101110101101110011001100011111101111010010111010101110101101110010111100111110101011001001111110111101001011101010111010110111001100110001111110111101001011101010111010110111001011110011111010110001001000101 3f7a5d5d6e663f7a5d5d6e5e7d593f7a5d5d6e663f7a5d5d6e5e7d6245
SJIS-WIN 杜z]]nf杜z]]n^}Y杜z]]nf杜z]]n^}bE 100100110110110101111010010111010101110101101110011001101001001101101101011110100101110101011101011011100101111001111101010110011001001101101101011110100101110101011101011011100110011010010011011011010111101001011101010111010110111001011110011111010110001001000101 936d7a5d5d6e66936d7a5d5d6e5e7d59936d7a5d5d6e66936d7a5d5d6e5e7d6245
EUC-JP 杜z]]nf杜z]]n^}Y杜z]]nf杜z]]n^}bE 110001011100111001111010010111010101110101101110011001101100010111001110011110100101110101011101011011100101111001111101010110011100010111001110011110100101110101011101011011100110011011000101110011100111101001011101010111010110111001011110011111010110001001000101 c5ce7a5d5d6e66c5ce7a5d5d6e5e7d59c5ce7a5d5d6e66c5ce7a5d5d6e5e7d6245
UTF-8 杜z]]nf杜z]]n^}Y杜z]]nf杜z]]n^}bE 11100110100111011001110001111010010111010101110101101110011001101110011010011101100111000111101001011101010111010110111001011110011111010101100111100110100111011001110001111010010111010101110101101110011001101110011010011101100111000111101001011101010111010110111001011110011111010110001001000101 e69d9c7a5d5d6e66e69d9c7a5d5d6e5e7d59e69d9c7a5d5d6e66e69d9c7a5d5d6e5e7d6245
UHC 杜z]]nf杜z]]n^}Y杜z]]nf杜z]]n^}bE 110101001110000101111010010111010101110101101110011001101101010011100001011110100101110101011101011011100101111001111101010110011101010011100001011110100101110101011101011011100110011011010100111000010111101001011101010111010110111001011110011111010110001001000101 d4e17a5d5d6e66d4e17a5d5d6e5e7d59d4e17a5d5d6e66d4e17a5d5d6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)