To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 叱鱸自v叱鱸自vB 11110001111110101000111010110110111010011110011110001110101010010111011011110001111110101000111010110110111010011110011110001110101010010111011001000010 f1fa8eb6e9e78ea976f1fa8eb6e9e78ea97642
EUC-JP ?叱鱸自v?叱鱸自vB 0011111110111100101110001111001011101001101111001010101101110110001111111011110010111000111100101110100110111100101010110111011001000010 3fbcb8f2e9bcab763fbcb8f2e9bcab7642
UTF-8 叱鱸自v叱鱸自vB 111011101000010110110101111001011000111110110001111010011011000110111000111010001000011110101010011101101110111010000101101101011110010110001111101100011110100110110001101110001110100010000111101010100111011001000010 ee85b5e58fb1e9b1b8e887aa76ee85b5e58fb1e9b1b8e887aa7642
UHC ?叱?自v?叱?自vB 001111111111001011101010001111111110110110111011011101100011111111110010111010100011111111101101101110110111011001000010 3ff2ea3fedbb763ff2ea3fedbb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)