To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥〓?二ゆ?癒??嚥〓?二ゆ?癒⑤?^ 100110101000101110000001101011000011111110010011111100011000001011100100001111111001011011111100001111110011111110011010100010111000000110101100001111111001001111110001100000101110010000111111100101101111110010000111010001000011111101011110 9a8b81ac3f93f182e43f96fc3f3f9a8b81ac3f93f182e43f96fc87443f5e
EUC-JP 嚥〓?二ゆ?癒??嚥〓?二ゆ?癒??^ 1101001111101011101000101010111000111111110001101111001110100100111001100011111111001100111111100011111100111111110100111110101110100010101011100011111111000110111100111010010011100110001111111100110011111110001111110011111101011110 d3eba2ae3fc6f3a4e63fccfe3f3fd3eba2ae3fc6f3a4e63fccfe3f3f5e
UTF-8 嚥〓ㅏ二ゆ에癒ⓦ돧嚥〓ㅏ二ゆ에癒⑤뤁^ 11100101100110101010010111100011100000001001001111100011100001011000111111100100101110101000110011100011100000101000011011101100100101111001000011100111100110011001001011100010100100111010011011101011100011111010011111100101100110101010010111100011100000001001001111100011100001011000111111100100101110101000110011100011100000101000011011101100100101111001000011100111100110011001001011100010100100011010010011101011101001001000000101011110 e59aa5e38093e3858fe4ba8ce38286ec9790e79992e293a6eb8fa7e59aa5e38093e3858fe4ba8ce38286ec9790e79992e291a4eba4815e
UHC 嚥〓ㅏ二ゆ에癒ⓦ돧嚥〓ㅏ二ゆ에癒⑤뤁^ 11100110101111111010000111101011101001001011111111101100101000111010101011100110101111111010000111101011101010001010100011100011100010011010101111100110101111111010000111101011101001001011111111101100101000111010101011100110101111111010000111101011101010001010100011101011100011111011001001011110 e6bfa1eba4bfeca3aae6bfa1eba8a8e389abe6bfa1eba4bfeca3aae6bfa1eba8a8eb8fb25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)