To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 鉛??鳶?????[鉛??鳶?????[^ 10001001100101000011111100111111100100111100111000111111001111110011111100111111001111110101101110001001100101000011111100111111100100111100111000111111001111110011111100111111001111110101101101011110 89943f3f93ce3f3f3f3f3f5b89943f3f93ce3f3f3f3f3f5b5e
EUC-JP 鉛??鳶?????[鉛??鳶?????[^ 10110001111101000011111100111111110001101101000000111111001111110011111100111111001111110101101110110001111101000011111100111111110001101101000000111111001111110011111100111111001111110101101101011110 b1f43f3fc6d03f3f3f3f3f5bb1f43f3fc6d03f3f3f3f3f5b5e
UTF-8 鉛당뙥鳶껆돱列⑴땭[鉛당뙥鳶껆돱列⑴땭[^ 111010011000100110011011111010111000101110111001111010111001100110100101111010011011001110110110111010101011101110000110111010111000111110110001111011111010011010011100111000101001000110110100111010111001010110101101010110111110100110001001100110111110101110001011101110011110101110011001101001011110100110110011101101101110101010111011100001101110101110001111101100011110111110100110100111001110001010010001101101001110101110010101101011010101101101011110 e9899beb8bb9eb99a5e9b3b6eabb86eb8fb1efa69ce291b4eb95ad5be9899beb8bb9eb99a5e9b3b6eabb86eb8fb1efa69ce291b4eb95ad5b5e
UHC 鉛당뙥鳶껆돱列⑴땭[鉛당뙥鳶껆돱列⑴땭[^ 111001101110011110110100111001111000110010101001111001101110100110000011111001111000100110110100111001101110101010101001111001111000101110000011010110111110011011100111101101001110011110001100101010011110011011101001100000111110011110001001101101001110011011101010101010011110011110001011100000110101101101011110 e6e7b4e78ca9e6e983e789b4e6eaa9e78b835be6e7b4e78ca9e6e983e789b4e6eaa9e78b835b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)