To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 傲??臆??餓??[傲??臆??餓??[^ 100110001111110000111111001111111000100110110000001111110011111110001001111011000011111100111111010110111001100011111100001111110011111110001001101100000011111100111111100010011110110000111111001111110101101101011110 98fc3f3f89b03f3f89ec3f3f5b98fc3f3f89b03f3f89ec3f3f5b5e
EUC-JP 傲??臆??餓??[傲??臆??餓??[^ 110100001111111000111111001111111011001010110010001111110011111110110010111011100011111100111111010110111101000011111110001111110011111110110010101100100011111100111111101100101110111000111111001111110101101101011110 d0fe3f3fb2b23f3fb2ee3f3f5bd0fe3f3fb2b23f3fb2ee3f3f5b5e
UTF-8 傲ⓧ뻑臆롨렓餓뜻닅[傲ⓧ뻑臆롨렓餓뜻닅[^ 111001011000001010110010111000101001001110100111111010111011101110010001111010001000011110000110111010111010000110101000111010111010000010010011111010011010010010010011111010111001110010111011111010111000101110000101010110111110010110000010101100101110001010010011101001111110101110111011100100011110100010000111100001101110101110100001101010001110101110100000100100111110100110100100100100111110101110011100101110111110101110001011100001010101101101011110 e582b2e293a7ebbb91e88786eba1a8eba093e9a493eb9cbbeb8b855be582b2e293a7ebbb91e88786eba1a8eba093e9a493eb9cbbeb8b855b5e
UHC 傲ⓧ뻑臆롨렓餓뜻닅[傲ⓧ뻑臆롨렓餓뜻닅[^ 111001111110110010101000111001001011101110110110111001011110011010001110111010001000111010101000111001001011101110110110111001101000100010001110010110111110011111101100101010001110010010111011101101101110010111100110100011101110100010001110101010001110010010111011101101101110011010001000100011100101101101011110 e7eca8e4bbb6e5e68ee88ea8e4bbb6e6888e5be7eca8e4bbb6e5e68ee88ea8e4bbb6e6888e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)