To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鳥???趙???全?憶?鳥???趙???全?憶?^ 100100101011100100111111001111110011111111100110111000100011111100111111001111111001000101010011001111111000100110101111001111111001001010111001001111110011111100111111111001101110001000111111001111110011111110010001010100110011111110001001101011110011111101011110 92b93f3f3fe6e23f3f3f91533f89af3f92b93f3f3fe6e23f3f3f91533f89af3f5e
EUC-JP 鳥???趙???全?憶?鳥???趙???全?憶?^ 110001001011101100111111001111110011111111101100111001000011111100111111001111111100000110110100001111111011001010110001001111111100010010111011001111110011111100111111111011001110010000111111001111110011111111000001101101000011111110110010101100010011111101011110 c4bb3f3f3fece43f3f3fc1b43fb2b13fc4bb3f3f3fece43f3f3fc1b43fb2b13f5e
UTF-8 鳥희렰렠趙얹렰렚全렖憶쌨鳥희렰렠趙얹렰렚全렖憶쌤^ 11101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011101000101101101001100111101100100101101011100111101011101000001011000011101011101000001001101011100101100001011010100011101011101000001001011011100110100001101011011011101100100011001010100011101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011101000101101101001100111101100100101101011100111101011101000001011000011101011101000001001101011100101100001011010100011101011101000001001011011100110100001101011011011101100100011001010010001011110 e9b3a5ed9daceba0b0eba0a0e8b699ec96b9eba0b0eba09ae585a8eba096e686b6ec8ca8e9b3a5ed9daceba0b0eba0a0e8b699ec96b9eba0b0eba09ae585a8eba096e686b6ec8ca45e
UHC 鳥희렰렠趙얹렰렚全렖憶쌨鳥희렰렠趙얹렰렚全렖憶쌤^ 11110000111010001100100011110001100011101011110110001110101100011111000011100001101111101111000110001110101111011000111010101101111011101110111110001110101010111110010111100011101111011101111011110000111010001100100011110001100011101011110110001110101100011111000011100001101111101111000110001110101111011000111010101101111011101110111110001110101010111110010111100011101111011101110001011110 f0e8c8f18ebd8eb1f0e1bef18ebd8eadeeef8eabe5e3bddef0e8c8f18ebd8eb1f0e1bef18ebd8eadeeef8eabe5e3bddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)