To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 臟棒?牆?臟棒?牆?[臟棒?牆?臟棒?牆?[^ 1110010001100110100101100101111100111111111000001010110100111111111001000110011010010110010111110011111111100000101011010011111101011011111001000110011010010110010111110011111111100000101011010011111111100100011001101001011001011111001111111110000010101101001111110101101101011110 e466965f3fe0ad3fe466965f3fe0ad3f5be466965f3fe0ad3fe466965f3fe0ad3f5b5e
EUC-JP 臟棒?牆?臟棒?牆?[臟棒?牆?臟棒?牆?[^ 1110011111000111110010111100000000111111111000001010111100111111111001111100011111001011110000000011111111100000101011110011111101011011111001111100011111001011110000000011111111100000101011110011111111100111110001111100101111000000001111111110000010101111001111110101101101011110 e7c7cbc03fe0af3fe7c7cbc03fe0af3f5be7c7cbc03fe0af3fe7c7cbc03fe0af3f5b5e
UTF-8 臟棒치牆렪臟棒치牆렪[臟棒치牆렪臟棒치牆렪[^ 111010001000011110011111111001101010001110010010111011001011100110011000111001111000100110000110111010111010000010101010111010001000011110011111111001101010001110010010111011001011100110011000111001111000100110000110111010111010000010101010010110111110100010000111100111111110011010100011100100101110110010111001100110001110011110001001100001101110101110100000101010101110100010000111100111111110011010100011100100101110110010111001100110001110011110001001100001101110101110100000101010100101101101011110 e8879fe6a392ecb998e78986eba0aae8879fe6a392ecb998e78986eba0aa5be8879fe6a392ecb998e78986eba0aae8879fe6a392ecb998e78986eba0aa5b5e
UHC 臟棒치牆렪臟棒치牆렪[臟棒치牆렪臟棒치牆렪[^ 11101101111101001101110011101010110001001010000111101101111011011000111010111000111011011111010011011100111010101100010010100001111011011110110110001110101110000101101111101101111101001101110011101010110001001010000111101101111011011000111010111000111011011111010011011100111010101100010010100001111011011110110110001110101110000101101101011110 edf4dceac4a1eded8eb8edf4dceac4a1eded8eb85bedf4dceac4a1eded8eb8edf4dceac4a1eded8eb85b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)