To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鳥???終???????鳥???終???????^ 1001001010111001001111110011111100111111100011110100100100111111001111110011111100111111001111110011111100111111100100101011100100111111001111110011111110001111010010010011111100111111001111110011111100111111001111110011111101011110 92b93f3f3f8f493f3f3f3f3f3f3f92b93f3f3f8f493f3f3f3f3f3f3f5e
EUC-JP 鳥???終???????鳥???終???????^ 1100010010111011001111110011111100111111101111011010101000111111001111110011111100111111001111110011111100111111110001001011101100111111001111110011111110111101101010100011111100111111001111110011111100111111001111110011111101011110 c4bb3f3f3fbdaa3f3f3f3f3f3f3fc4bb3f3f3fbdaa3f3f3f3f3f3f3f5e
UTF-8 鳥희렰렠終꿸렣띳렰렲吏넸鳥희렰렠終꿸렣띳렰렲吏넵^ 11101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011100111101101011000001011101010101111111011100011101011101000001010001111101011100111011011001111101011101000001011000011101011101000001011001011101111101001111001111011101011100001001011100011101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011100111101101011000001011101010101111111011100011101011101000001010001111101011100111011011001111101011101000001011000011101011101000001011001011101111101001111001111011101011100001001011010101011110 e9b3a5ed9daceba0b0eba0a0e7b582eabfb8eba0a3eb9db3eba0b0eba0b2efa79eeb84b8e9b3a5ed9daceba0b0eba0a0e7b582eabfb8eba0a3eb9db3eba0b0eba0b2efa79eeb84b55e
UHC 鳥희렰렠終꿸렣띳렰렲吏넸鳥희렰렠終꿸렣띳렰렲吏넵^ 11110000111010001100100011110001100011101011110110001110101100011111000011111011101100101110101010001110101101001011011011110001100011101011110110001110101111111110110010100111101100111101111011110000111010001100100011110001100011101011110110001110101100011111000011111011101100101110101010001110101101001011011011110001100011101011110110001110101111111110110010100111101100111101110001011110 f0e8c8f18ebd8eb1f0fbb2ea8eb4b6f18ebd8ebfeca7b3def0e8c8f18ebd8eb1f0fbb2ea8eb4b6f18ebd8ebfeca7b3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)