To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 髫カ迹夲スス貂ョ逸晞垳迹夲スス貂ョ逸嫂 11101001100110101011011011100111100100011001101011101111101111011011110111100110101110001010111011111011101101001001110111101001100110101011011011100111100100011001101011101111101111011011110111100110101110001010111011111011101101001001101101011110 e99ab6e7919aefbdbde6b8aefbb49de99ab6e7919aefbdbde6b8aefbb49b5e
EUC-JP 髫カ迹夲スス貂ョ?晞垳迹夲スス貂ョ?嫂 111100011111101010001110101101101110110111110001110101001111000110001110101111011000111010111101111011001011101010001110101011100011111111011010111010111101010010111000111011011111000111010100111100011000111010111101100011101011110111101100101110101000111010101110001111111101010110111111 f1fa8eb6edf1d4f18ebd8ebdecba8eae3fdaebd4b8edf1d4f18ebd8ebdecba8eae3fd5bf
UTF-8 髫カ迹夲スス貂ョ逸晞垳迹夲スス貂ョ逸嫂 111010011010101110101011111011111011110110110110111010001011111110111001111001011010010010110010111011111011110110111101111011111011110110111101111010001011001010000010111011111011110110101110111011111010100010100101111001101001100110011110111001011001111010110011111010001011111110111001111001011010010010110010111011111011110110111101111011111011110110111101111010001011001010000010111011111011110110101110111011111010100010100101111001011010101110000010 e9ababefbdb6e8bfb9e5a4b2efbdbdefbdbde8b282efbdaeefa8a5e6999ee59eb3e8bfb9e5a4b2efbdbdefbdbde8b282efbdaeefa8a5e5ab82
UHC ??迹???貂??晞?迹???貂??嫂 00111111001111111110111011101001001111110011111100111111111101011011000000111111001111111111110111110101001111111110111011101001001111110011111100111111111101011011000000111111001111111110000111111001 3f3feee93f3f3ff5b03f3ffdf53feee93f3f3ff5b03f3fe1f9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)