To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8宜??音?????????????? 11100001100111110011111110000010010101111000101101011000001111110011111110001001101110010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 e19f3f82578b583f3f89b93f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP 癲?8宜??音??孼?????????瑗? 1110001010100001001111111010001110111000101101011011100100111111001111111011001010111011001111110011111110001111101110101100001100111111001111110011111100111111001111110011111100111111001111110011111110001111110011001100000000111111 e2a13fa3b8b5b93f3fb2bb3f3f8fbac33f3f3f3f3f3f3f3f3f8fccc03f
UTF-8 癲쒕8宜룩눧音쀬뵯孼꾊랁룍嶺뚯쉶理껅꼮瑗뢊 111001111001100110110010111011001001001010010101111011111011110010011000111001011010111010011100111010111010001110101001111010111000100010100111111010011001111110110011111011001000000010101100111010111011010110101111111001011010110110111100111010101011111010001010111010111001111010000001111010111010001110001101111011111010011010101011111010111001101010101111111011001000100110110110111011111010011110100100111010101011101110000101111010101011110010101110111001111001000110010111111010111010001010001010 e799b2ec9295efbc98e5ae9ceba3a9eb88a7e99fb3ec80acebb5afe5adbceabe8aeb9e81eba38defa6abeb9aafec89b6efa7a4eabb85eabcaee79197eba28a
UHC 癲쒕8宜룩눧音쀬뵯孼꾊랁룍嶺뚯쉶理껅꼮瑗뢊 111011111010011010011100111010111010001110111000111010111111000110110111111010001000011110111110111010111110010110010111111011001001010010101101111001011110110110000100110100011000110111101101100011111000101111100111101011011000110011101100100110101000110011101100101101011000001111100110100001001000100111101010101111001000111101000110 efa69ceba3b8ebf1b7e887beebe597ec94ade5ed84d18ded8f8be7ad8cec9a8cecb583e68489eabc8f46

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)