To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 歪?????齬??[歪?????齬??[^ 10011000011000110011111100111111001111110011111100111111111010101001011100111111001111110101101110011000011000110011111100111111001111110011111100111111111010101001011100111111001111110101101101011110 98633f3f3f3f3fea973f3f5b98633f3f3f3f3fea973f3f5b5e
EUC-JP 歪?????齬??[歪?????齬??[^ 11001111110001000011111100111111001111110011111100111111111100111111011100111111001111110101101111001111110001000011111100111111001111110011111100111111111100111111011100111111001111110101101101011110 cfc43f3f3f3f3ff3f73f3f5bcfc43f3f3f3f3ff3f73f3f5b5e
UTF-8 歪귝깄旅앶굚齬끾줉[歪귝깄旅앶굚齬끾줉[^ 111001101010110110101010111010101011011110011101111010101011100110000100111011111010011010000011111011001001010110110110111010101011010110011010111010011011110110101100111010111000000110111110111011001010010010001001010110111110011010101101101010101110101010110111100111011110101010111001100001001110111110100110100000111110110010010101101101101110101010110101100110101110100110111101101011001110101110000001101111101110110010100100100010010101101101011110 e6adaaeab79deab984efa683ec95b6eab59ae9bdaceb81beeca4895be6adaaeab79deab984efa683ec95b6eab59ae9bdaceb81beeca4895b5e
UHC 歪귝깄旅앶굚齬끾줉[歪귝깄旅앶굚齬끾줉[^ 111010001110000010000010111001101000001110000101111001101010000110011101111010011000001010000010111001011110000110000101111001101010000110011101010110111110100011100000100000101110011010000011100001011110011010100001100111011110100110000010100000101110010111100001100001011110011010100001100111010101101101011110 e8e082e68385e6a19de98282e5e185e6a19d5be8e082e68385e6a19de98282e5e185e6a19d5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)