To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??????遙??[??????遙??[^ 0011111100111111001111110011111100111111001111111110101010100001001111110011111101011011001111110011111100111111001111110011111100111111111010101010000100111111001111110101101101011110 3f3f3f3f3f3feaa13f3f5b3f3f3f3f3f3feaa13f3f5b5e
EUC-JP 旿?????遙??[旿?????遙??[^ 100011111100000111110100001111110011111100111111001111110011111111110100101000110011111100111111010110111000111111000001111101000011111100111111001111110011111100111111111101001010001100111111001111110101101101011110 8fc1f43f3f3f3f3ff4a33f3f5b8fc1f43f3f3f3f3ff4a33f3f5b5e
UTF-8 旿⑼쉭遼섓쉥遙뤄쉐[旿⑼쉭遼섓쉥遙뤄쉐[^ 111001101001011110111111111000101001000110111100111011001000100110101101111011111010011110000011111011001000010010010011111011001000100110100101111010011000000110011001111010111010010010000100111011001000100110010000010110111110011010010111101111111110001010010001101111001110110010001001101011011110111110100111100000111110110010000100100100111110110010001001101001011110100110000001100110011110101110100100100001001110110010001001100100000101101101011110 e697bfe291bcec89adefa783ec8493ec89a5e98199eba484ec89905be697bfe291bcec89adefa783ec8493ec89a5e98199eba484ec89905b5e
UHC 旿⑼쉭遼섓쉥遙뤄쉐[旿⑼쉭遼섓쉥遙뤄쉐[^ 111001111111101010101001111011111011110110101101111010011010110010011000111011111011110110101011111010011010101110110111111011111011110110100110010110111110011111111010101010011110111110111101101011011110100110101100100110001110111110111101101010111110100110101011101101111110111110111101101001100101101101011110 e7faa9efbdade9ac98efbdabe9abb7efbda65be7faa9efbdade9ac98efbdabe9abb7efbda65b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)