To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲?????耳??矣??瑤??B 1110000110011111001111110011111100111111001111110011111110001110101010000011111100111111111000011110000100111111001111111110101010100010001111110011111101000010 e19f3f3f3f3f3f8ea83f3fe1e13f3feaa23f3f42
EUC-JP 癲?????耳??矣??瑤??B 1110001010100001001111110011111100111111001111110011111110111100101010100011111100111111111000101110001100111111001111111111010010100100001111110011111101000010 e2a13f3f3f3f3fbcaa3f3fe2e33f3ff4a43f3f42
UTF-8 癲숈슜柳뗩윹耳뀐쬂矣⑸솇瑤녹섣B 11100111100110011011001011101100100010001000100011101100100010101001110011101111101001111000100111101011100101111010100111101100100111001011100111101000100000001011001111101011100000001001000011101100101011001000001011100111100111111010001111100010100100011011100011101100100001101000011111100111100100011010010011101011100001011011100111101100100001001010001101000010 e799b2ec8888ec8a9cefa789eb97a9ec9cb9e880b3eb8090ecac82e79fa3e291b8ec8687e791a4eb85b9ec84a342
UHC 癲숈슜柳뗩윹耳뀐쬂矣⑸솇瑤녹섣B 11101111101001101001100111101100100110101010100111101010111101111000101111101001100111111011001111101100101111001011001011101111101001101001100111101011111110001010100111101011100110011000101111101000111111011011001111101100101111001011001001000010 efa699ec9aa9eaf78be99fb3ecbcb2efa699ebf8a9eb998be8fdb3ecbcb242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)