To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 辿続其棚他造棚歎 10010010010010001001000110110001100100011011010010010010010010011001000110111100100100011010001010010010010010011001001001010110 924891b191b4924991bc91a292499256
EUC-JP 辿続其棚他造棚歎 11000011101010011100001010110011110000101011011011000011101010101100001010111110110000101010010011000011101010101100001110110111 c3a9c2b3c2b6c3aac2bec2a4c3aac3b7
UTF-8 辿続其棚他造棚歎 111010001011111010111111111001111011011010011010111001011000010110110110111001101010001110011010111001001011101110010110111010011000000010100000111001101010001110011010111001101010110110001110 e8bebfe7b69ae585b6e6a39ae4bb96e980a0e6a39ae6ad8e
UHC ??其棚他造棚歎 0011111100111111110100001110110011011101110111001111011011100010111100001110001111011101110111001111011110100111 3f3fd0ecdddcf6e2f0e3dddcf7a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)