To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 甑???禎???程?????禎???程?泣 100011011001100100111111001111110011111110010010111101010011111100111111001111111001001011110110001111110011111100111111001111110011111110010010111101010011111100111111001111111001001011110110001111111000101110000011 8d993f3f3f92f53f3f3f92f63f3f3f3f3f92f53f3f3f92f63f8b83
EUC-JP 甑???禎???程?釪?檉?禎???程?泣 10111001111110010011111100111111001111111100010011110111001111110011111100111111110001001111100000111111100011111110001110101101001111111000111111000101101110110011111111000100111101110011111100111111001111111100010011111000001111111011010111100011 b9f93f3f3fc4f73f3f3fc4f83f8fe3ad3f8fc5bb3fc4f73f3f3fc4f83fb5e3
UTF-8 甑희렰렓禎뀜렰렧程렣釪렟檉렢禎뀜렰렧程렣泣 111001111001010010010001111011011001110110101100111010111010000010110000111010111010000010010011111001111010011010001110111010111000000010011100111010111010000010110000111010111010000010100111111001111010100010001011111010111010000010100011111010011000011110101010111010111010000010011111111001101010101010001001111010111010000010100010111001111010011010001110111010111000000010011100111010111010000010110000111010111010000010100111111001111010100010001011111010111010000010100011111001101011001110100011 e79491ed9daceba0b0eba093e7a68eeb809ceba0b0eba0a7e7a88beba0a3e987aaeba09fe6aa89eba0a2e7a68eeb809ceba0b0eba0a7e7a88beba0a3e6b3a3
UHC 甑희렰렓禎뀜렰렧程렣釪렟檉렢禎뀜렰렧程렣泣 111100011111011111001000111100011000111010111101100011101010100011101111111011101011001011110001100011101011110110001110101101101110111111101111100011101011010011101001111010011000111010110000111011111110000010001110101100111110111111101110101100101111000110001110101111011000111010110110111011111110111110001110101101001110101111101000 f1f7c8f18ebd8ea8efeeb2f18ebd8eb6efef8eb4e9e98eb0efe08eb3efeeb2f18ebd8eb6efef8eb4ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)