To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 赦梢フv赦梢フvB 1000111011001101100011111011110111110000100011101100110001110110100011101100110110001111101111011111000010001110110011000111011001000010 8ecd8fbdf08ecc768ecd8fbdf08ecc7642
EUC-JP 赦梢?フv赦梢?フvB 1011110011001111101111101011111100111111100011101100110001110110101111001100111110111110101111110011111110001110110011000111011001000010 bccfbebf3f8ecc76bccfbebf3f8ecc7642
UTF-8 赦梢フv赦梢フvB 111010001011010110100110111001101010001010100010111011101000000110001101111011111011111010001100011101101110100010110101101001101110011010100010101000101110111010000001100011011110111110111110100011000111011001000010 e8b5a6e6a2a2ee818defbe8c76e8b5a6e6a2a2ee818defbe8c7642
UHC 赦梢??v赦梢??vB 110111101111010111110100111111100011111100111111011101101101111011110101111101001111111000111111001111110111011001000010 def5f4fe3f3f76def5f4fe3f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)