To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 竪促探竪v竪促探竪vB 10010010010001111001000110100011100100100101010010010010010001110111011010010010010001111001000110100011100100100101010010010010010001110111011001000010 924791a39254924776924791a3925492477642
EUC-JP 竪促探竪v竪促探竪vB 11000011101010001100001010100101110000111011010111000011101010000111011011000011101010001100001010100101110000111011010111000011101010000111011001000010 c3a8c2a5c3b5c3a876c3a8c2a5c3b5c3a87642
UTF-8 竪促探竪v竪促探竪vB 111001111010101110101010111001001011111110000011111001101000111010100010111001111010101110101010011101101110011110101011101010101110010010111111100000111110011010001110101000101110011110101011101010100111011001000010 e7abaae4bf83e68ea2e7abaa76e7abaae4bf83e68ea2e7abaa7642
UHC 竪促探竪v竪促探竪vB 11100010101101011111010110110101111101111010111011100010101101010111011011100010101101011111010110110101111101111010111011100010101101010111011001000010 e2b5f5b5f7aee2b576e2b5f5b5f7aee2b57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)