To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????BN}????????BN{^ 0011111100111111001111110011111100111111001111110011111100111111010000100100111001111101001111110011111100111111001111110011111100111111001111110011111101000010010011100111101101011110 3f3f3f3f3f3f3f3f424e7d3f3f3f3f3f3f3f3f424e7b5e
SJIS-WIN 鳥???栓賂??BN}鳥???栓賂??BN{^ 1001001010111001001111110011111100111111100100001111000010011000010001110011111100111111010000100100111001111101100100101011100100111111001111110011111110010000111100001001100001000111001111110011111101000010010011100111101101011110 92b93f3f3f90f098473f3f424e7d92b93f3f3f90f098473f3f424e7b5e
EUC-JP 鳥???栓賂??BN}鳥???栓賂??BN{^ 1100010010111011001111110011111100111111110000001111001011001111101010000011111100111111010000100100111001111101110001001011101100111111001111110011111111000000111100101100111110101000001111110011111101000010010011100111101101011110 c4bb3f3f3fc0f2cfa83f3f424e7dc4bb3f3f3fc0f2cfa83f3f424e7b5e
UTF-8 鳥희렰렠栓賂렰렣BN}鳥희렰렠栓賂렰렣BN{^ 11101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011100110101000001001001111101000101100111000001011101011101000001011000011101011101000001010001101000010010011100111110111101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011100110101000001001001111101000101100111000001011101011101000001011000011101011101000001010001101000010010011100111101101011110 e9b3a5ed9daceba0b0eba0a0e6a093e8b382eba0b0eba0a3424e7de9b3a5ed9daceba0b0eba0a0e6a093e8b382eba0b0eba0a3424e7b5e
UHC 鳥희렰렠栓賂렰렣BN}鳥희렰렠栓賂렰렣BN{^ 111100001110100011001000111100011000111010111101100011101011000111101110111110111101011011110001100011101011110110001110101101000100001001001110011111011111000011101000110010001111000110001110101111011000111010110001111011101111101111010110111100011000111010111101100011101011010001000010010011100111101101011110 f0e8c8f18ebd8eb1eefbd6f18ebd8eb4424e7df0e8c8f18ebd8eb1eefbd6f18ebd8eb4424e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)