To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????}v??????}vB 0011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f7d763f3f3f3f3f3f7d7642
SJIS-WIN 竪他俗誰遜則}v竪他俗誰遜則}vB 1001001001000111100100011011110010010001101011011001001001001110100100011011101110010001101001010111110101110110100100100100011110010001101111001001000110101101100100100100111010010001101110111001000110100101011111010111011001000010 924791bc91ad924e91bb91a57d76924791bc91ad924e91bb91a57d7642
EUC-JP 竪他俗誰遜則}v竪他俗誰遜則}vB 1100001110101000110000101011111011000010101011111100001110101111110000101011110111000010101001110111110101110110110000111010100011000010101111101100001010101111110000111010111111000010101111011100001010100111011111010111011001000010 c3a8c2bec2afc3afc2bdc2a77d76c3a8c2bec2afc3afc2bdc2a77d7642
UTF-8 竪他俗誰遜則}v竪他俗誰遜則}vB 1110011110101011101010101110010010111011100101101110010010111111100101111110100010101010101100001110100110000001100111001110010110001001100001110111110101110110111001111010101110101010111001001011101110010110111001001011111110010111111010001010101010110000111010011000000110011100111001011000100110000111011111010111011001000010 e7abaae4bb96e4bf97e8aab0e9819ce589877d76e7abaae4bb96e4bf97e8aab0e9819ce589877d7642
UHC 竪他俗誰遜則}v竪他俗誰遜則}vB 1110001010110101111101101110001011100001110101001110001011000001111000011110000111110110110011100111110101110110111000101011010111110110111000101110000111010100111000101100000111100001111000011111011011001110011111010111011001000010 e2b5f6e2e1d4e2c1e1e1f6ce7d76e2b5f6e2e1d4e2c1e1e1f6ce7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)