To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鳥???趙???全?億?鳥???趙???全?億?^ 100100101011100100111111001111110011111111100110111000100011111100111111001111111001000101010011001111111000100110101101001111111001001010111001001111110011111100111111111001101110001000111111001111110011111110010001010100110011111110001001101011010011111101011110 92b93f3f3fe6e23f3f3f91533f89ad3f92b93f3f3fe6e23f3f3f91533f89ad3f5e
EUC-JP 鳥???趙???全?億?鳥???趙???全?億?^ 110001001011101100111111001111110011111111101100111001000011111100111111001111111100000110110100001111111011001010101111001111111100010010111011001111110011111100111111111011001110010000111111001111110011111111000001101101000011111110110010101011110011111101011110 c4bb3f3f3fece43f3f3fc1b43fb2af3fc4bb3f3f3fece43f3f3fc1b43fb2af3f5e
UTF-8 鳥희렰렠趙얹렰렚全렖億렏鳥희렰렠趙얹렰렚全렖億렏^ 11101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011101000101101101001100111101100100101101011100111101011101000001011000011101011101000001001101011100101100001011010100011101011101000001001011011100101100001001000010011101011101000001000111111101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011101000101101101001100111101100100101101011100111101011101000001011000011101011101000001001101011100101100001011010100011101011101000001001011011100101100001001000010011101011101000001000111101011110 e9b3a5ed9daceba0b0eba0a0e8b699ec96b9eba0b0eba09ae585a8eba096e58484eba08fe9b3a5ed9daceba0b0eba0a0e8b699ec96b9eba0b0eba09ae585a8eba096e58484eba08f5e
UHC 鳥희렰렠趙얹렰렚全렖億렏鳥희렰렠趙얹렰렚全렖億렏^ 11110000111010001100100011110001100011101011110110001110101100011111000011100001101111101111000110001110101111011000111010101101111011101110111110001110101010111110010111100010100011101010010111110000111010001100100011110001100011101011110110001110101100011111000011100001101111101111000110001110101111011000111010101101111011101110111110001110101010111110010111100010100011101010010101011110 f0e8c8f18ebd8eb1f0e1bef18ebd8eadeeef8eabe5e28ea5f0e8c8f18ebd8eb1f0e1bef18ebd8eadeeef8eabe5e28ea55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)