To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN シム赳vシム赳vB 1011110011010001111001101110000001110110101111001101000111100110111000000111011001000010 bcd1e6e076bcd1e6e07642
EUC-JP シム赳vシム赳vB 100011101011110010001110110100011110110011100010011101101000111010111100100011101101000111101100111000100111011001000010 8ebc8ed1ece2768ebc8ed1ece27642
UTF-8 シム赳vシム赳vB 111011111011110110111100111011111011111010010001111010001011010110110011011101101110111110111101101111001110111110111110100100011110100010110101101100110111011001000010 efbdbcefbe91e8b5b376efbdbcefbe91e8b5b37642
UHC ??赳v??赳vB 0011111100111111110100001010111101110110001111110011111111010000101011110111011001000010 3f3fd0af763f3fd0af7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)