To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????BN}????????BN{^ 0011111100111111001111110011111100111111001111110011111100111111010000100100111001111101001111110011111100111111001111110011111100111111001111110011111101000010010011100111101101011110 3f3f3f3f3f3f3f3f424e7d3f3f3f3f3f3f3f3f424e7b5e
SJIS-WIN 鳥???畢賂??BN}鳥???畢賂??BN{^ 1001001010111001001111110011111100111111100101010100110010011000010001110011111100111111010000100100111001111101100100101011100100111111001111110011111110010101010011001001100001000111001111110011111101000010010011100111101101011110 92b93f3f3f954c98473f3f424e7d92b93f3f3f954c98473f3f424e7b5e
EUC-JP 鳥???畢賂??BN}鳥???畢賂??BN{^ 1100010010111011001111110011111100111111110010011010110111001111101010000011111100111111010000100100111001111101110001001011101100111111001111110011111111001001101011011100111110101000001111110011111101000010010011100111101101011110 c4bb3f3f3fc9adcfa83f3f424e7dc4bb3f3f3fc9adcfa83f3f424e7b5e
UTF-8 鳥희렰렠畢賂렰렣BN}鳥희렰렠畢賂렰렣BN{^ 11101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011100111100101011010001011101000101100111000001011101011101000001011000011101011101000001010001101000010010011100111110111101001101100111010010111101101100111011010110011101011101000001011000011101011101000001010000011100111100101011010001011101000101100111000001011101011101000001011000011101011101000001010001101000010010011100111101101011110 e9b3a5ed9daceba0b0eba0a0e795a2e8b382eba0b0eba0a3424e7de9b3a5ed9daceba0b0eba0a0e795a2e8b382eba0b0eba0a3424e7b5e
UHC 鳥희렰렠畢賂렰렣BN}鳥희렰렠畢賂렰렣BN{^ 111100001110100011001000111100011000111010111101100011101011000111111001101101001101011011110001100011101011110110001110101101000100001001001110011111011111000011101000110010001111000110001110101111011000111010110001111110011011010011010110111100011000111010111101100011101011010001000010010011100111101101011110 f0e8c8f18ebd8eb1f9b4d6f18ebd8eb4424e7df0e8c8f18ebd8eb1f9b4d6f18ebd8eb4424e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)