To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??孺??循??齬??而??循??億?? 11101010010111110011111100111111100110110111110100111111001111111000111101111010001111110011111111101010100101110011111100111111100011101010011100111111001111111000111101111010001111110011111110001001101011010011111100111111 ea5f3f3f9b7d3f3f8f7a3f3fea973f3f8ea73f3f8f7a3f3f89ad3f3f
EUC-JP 鸚??孺??循??齬??而??循??億?? 11110011110000000011111100111111110101011101111000111111001111111011110111011011001111110011111111110011111101110011111100111111101111001010100100111111001111111011110111011011001111110011111110110010101011110011111100111111 f3c03f3fd5de3f3fbddb3f3ff3f73f3fbca93f3fbddb3f3fb2af3f3f
UTF-8 鸚쒓퍔孺욘쾮循녿겱齬잆굦而숁듉循뗪턂億됱슣 111010011011100010011010111011001001001010010011111011011000110110010100111001011010110110111010111011001001101010011000111011001011111010101110111001011011111010101010111010111000010110111111111010101011001010110001111010011011110110101100111011001001111010000110111010101011010110100110111010001000000010001100111011001000100010000001111010111001001110001001111001011011111010101010111010111001011110101010111011011000010010000010111001011000010010000100111010111001000010110001111011001000101010100011 e9b89aec9293ed8d94e5adbaec9a98ecbeaee5beaaeb85bfeab2b1e9bdacec9e86eab5a6e8808cec8881eb9389e5beaaeb97aaed8482e58484eb90b1ec8aa3
UHC 鸚쒓퍔孺욘쾮循녿겱齬잆굦而숁듉循뗪턂億됱슣 111001011010010010011100111010101011101110001011111010101110100010111111111001101011001010000101111000101110000010000110111010111000000110111101111001011110000110011111111000111000001010001100111011001011101110011001111001101000101010111100111000101110000010001011111010101011010110011110111001011110001010001001111011001001101010101111 e5a49ceabb8beae8bfe6b285e2e086eb81bde5e19fe3828cecbb99e68abce2e08beab59ee5e289ec9aaf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)