To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鵝????∀庸?????儀??閻i?懿??B 1110101001000000001111110011111100111111001111111000000111001101100101110110011000111111001111110011111100111111001111111000101101010110001111110011111111101000100001011000001010001001001111111001110011110010001111110011111101000010 ea403f3f3f3f81cd97663f3f3f3f3f8b563f3fe88582893f9cf23f3f42
EUC-JP 鵝??瑗?∀庸?????儀??閻i?懿??B 11110011101000010011111100111111100011111100110011000000001111111010001011001111110011011100011100111111001111110011111100111111001111111011010110110111001111110011111111101111111001011010001111101001001111111101100011110100001111110011111101000010 f3a13f3f8fccc03fa2cfcdc73f3f3f3f3fb5b73f3fefe5a3e93fd8f43f3f42
UTF-8 鵝숈뮄瑗띸∀庸뉗빓留곦퐰儀볤펾閻i펶懿룸쭅B 11101001101101011001110111101100100010001000100011101011101011101000010011100111100100011001011111101011100111011011100011100010100010001000000011100101101110101011100011101011100010011001011111101011101110011001001111101111101001111000110111101010101100111010011011101101100100001011000011100101100001001000000011101011101100111010010011101101100011101011111011101001100101101011101111101111101111011000100111101101100011101011011011100110100001111011111111101011101000111011100011101100101011011000010101000010 e9b59dec8888ebae84e79197eb9db8e28880e5bab8eb8997ebb993efa78deab3a6ed90b0e58480ebb3a4ed8ebee996bbefbd89ed8eb6e687bfeba3b8ecad8542
UHC 鵝숈뮄瑗띸∀庸뉗빓留곦퐰儀볤펾閻i펶懿룸쭅B 11100100101111011001100111101100100100101001001111101010101111001000110111100111101000101010001111101001101111001000011111101100100101011011011111101011101001111000000111100100101111011001100111101011111100001001001111101010101111001000110111100111101000101010001111101001101111001000011111101011111100111011011111101011101001111000000101000010 e4bd99ec9293eabc8de7a2a3e9bc87ec95b7eba781e4bd99ebf093eabc8de7a2a3e9bc87ebf3b7eba78142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)