To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 韜穂クマv韜穂クマvB 1110100011101001100101011110010010111000100000110111110101110110111010001110100110010101111001001011100010000011011111010111011001000010 e8e995e4b8837d76e8e995e4b8837d7642
EUC-JP 韜穂クマv韜穂クマvB 11110000111010111100101011100110100011101011100010100101110111100111011011110000111010111100101011100110100011101011100010100101110111100111011001000010 f0ebcae68eb8a5de76f0ebcae68eb8a5de7642
UTF-8 韜穂クマv韜穂クマvB 111010011001111110011100111001111010100110000010111011111011110110111000111000111000001110011110011101101110100110011111100111001110011110101001100000101110111110111101101110001110001110000011100111100111011001000010 e99f9ce7a982efbdb8e3839e76e99f9ce7a982efbdb8e3839e7642
UHC 韜??マv韜??マvB 110101001011011100111111001111111010101111011110011101101101010010110111001111110011111110101011110111100111011001000010 d4b73f3fabde76d4b73f3fabde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)