To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?弔???伊豆?矜?製?制 100111111100010000111111100100101010001000111111001111110011111110001000110010011001001110100100001111111110000111100000001111111001000010111011001111111001000010100111 9fc43f92a23f3f3f88c993a43fe1e03f90bb3f90a7
EUC-JP 淨?弔???伊豆?矜?製?制 110111101100011000111111110001001010010000111111001111110011111110110000110010111100011010100110001111111110001011100010001111111100000010111101001111111100000010101001 dec63fc4a43f3f3fb0cbc6a63fe2e23fc0bd3fc0a9
UTF-8 淨렠弔렟罹렗伊豆렚矜썬製렩制 111001101011011110101000111010111010000010100000111001011011110010010100111010111010000010011111111011111010011110100110111010111010000010010111111001001011110010001010111010001011000110000110111010111010000010011010111001111001111110011100111011001000110110101100111010001010001110111101111010111010000010101001111001011000100010110110 e6b7a8eba0a0e5bc94eba09fefa7a6eba097e4bc8ae8b186eba09ae79f9cec8dace8a3bdeba0a9e588b6
UHC 淨렠弔렟罹렗伊豆렚矜썬製렩制 11101111111001001000111010110001111100001100000010001110101100001110110010111010100011101010110011101100101001011101010011100111100011101010110111010000111010001011110111100011111100001011001010001110101101111111000010100100 efe48eb1f0c08eb0ecba8eaceca5d4e78eadd0e8bde3f0b28eb7f0a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)