To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷?????猷??循?+肄??猷ロ?循 100101110101000100111111001111110011111100111111001111111001011101010001001111110011111110001111011110100011111110000001011110111110001111100101001111110011111110010111010100011000001110001101001111111000111101111010 97513f3f3f3f3f97513f3f8f7a3f817be3e53f3f9751838d3f8f7a
EUC-JP 猷?????猷??循?+肄??猷ロ?循 110011011011001000111111001111110011111100111111001111111100110110110010001111110011111110111101110110110011111110100001110111001110011011100111001111110011111111001101101100101010010111101101001111111011110111011011 cdb23f3f3f3f3fcdb23f3fbddb3fa1dce6e73f3fcdb2a5ed3fbddb
UTF-8 猷띕뇾輦됤닽猷띤뼳循뀀+肄볝궞猷ロ쐛循 111001111000110010110111111010111001110110010101111010111000011110111110111011111010011010011000111010111001000010100100111010111000101110111101111001111000110010110111111010111001110110100100111010111011110010110011111001011011111010101010111010111000000010000000111011111011110010001011111010001000001010000100111010111011001110011101111010101011011010011110111001111000110010110111111000111000001110101101111011001001000010011011111001011011111010101010 e78cb7eb9d95eb87beefa698eb90a4eb8bbde78cb7eb9da4ebbcb3e5beaaeb8080efbc8be88284ebb39deab69ee78cb7e383adec909be5beaa
UHC 猷띕뇾輦됤닽猷띤뼳循뀀+肄볝궞猷ロ쐛循 1110101110100011101101101110101110000111100111111110011011100100100010011110001010001000101010111110101110100011101101101110110110010110101101101110001011100000101100101110101110100011101010111110110010111101100100111110001110000010101100011110101110100011101010111110110110011100100000011110001011100000 eba3b6eb879fe6e489e288abeba3b6ed96b6e2e0b2eba3abecbd93e382b1eba3abed9c81e2e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)