To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 歪?????齬??[歪?????齬??[^ 10011000011000110011111100111111001111110011111100111111111010101001011100111111001111110101101110011000011000110011111100111111001111110011111100111111111010101001011100111111001111110101101101011110 98633f3f3f3f3fea973f3f5b98633f3f3f3f3fea973f3f5b5e
EUC-JP 歪?????齬??[歪?????齬??[^ 11001111110001000011111100111111001111110011111100111111111100111111011100111111001111110101101111001111110001000011111100111111001111110011111100111111111100111111011100111111001111110101101101011110 cfc43f3f3f3f3ff3f73f3f5bcfc43f3f3f3f3ff3f73f3f5b5e
UTF-8 歪귝깄嶺믧굚齬끿춶[歪귝깄嶺믧굚齬끿춶[^ 111001101010110110101010111010101011011110011101111010101011100110000100111011111010011010101011111010111010111110100111111010101011010110011010111010011011110110101100111010111000000110111111111011001011011010110110010110111110011010101101101010101110101010110111100111011110101010111001100001001110111110100110101010111110101110101111101001111110101010110101100110101110100110111101101011001110101110000001101111111110110010110110101101100101101101011110 e6adaaeab79deab984efa6abebafa7eab59ae9bdaceb81bfecb6b65be6adaaeab79deab984efa6abebafa7eab59ae9bdaceb81bfecb6b65b5e
UHC 歪귝깄嶺믧굚齬끿춶[歪귝깄嶺믧굚齬끿춶[^ 111010001110000010000010111001101000001110000101111001111010110110010010111010011000001010000010111001011110000110000101111001111010110110010010010110111110100011100000100000101110011010000011100001011110011110101101100100101110100110000010100000101110010111100001100001011110011110101101100100100101101101011110 e8e082e68385e7ad92e98282e5e185e7ad925be8e082e68385e7ad92e98282e5e185e7ad925b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)