To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????BF 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
SJIS-WIN ????????????????????BF 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
EUC-JP ????????????????????BF 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
UTF-8 챵짰혦짙챗쩌혦짯챤챔째챌혦짱횖챰혦쩍혦짖BF 1110110010110001101101011110110010100111101100001110110110011000101001101110110010100111100110011110110010110001100101111110110010101001100011001110110110011000101001101110110010100111101011111110110010110001101001001110110010110001100101001110110010100111101110001110110010110001100011001110110110011000101001101110110010100111101100011110110110011010100101101110110010110001101100001110110110011000101001101110110010101001100011011110110110011000101001101110110010100111100101100100001001000110 ecb1b5eca7b0ed98a6eca799ecb197eca98ced98a6eca7afecb1a4ecb194eca7b8ecb18ced98a6eca7b1ed9a96ecb1b0ed98a6eca98ded98a6eca7964246
UHC 챵짰혦짙챗쩌혦짯챤챔째챌혦짱횖챰혦쩍혦짖BF 110000111011001011000010101011101100001010001110110000101010001111000011101010101100001010111100110000101000111011000010101011011100001110101110110000111010100011000010101100001100001110100111110000101000111011000010101011111100001110010000110000111011000111000010100011101100001010111101110000101000111011000010101000100100001001000110 c3b2c2aec28ec2a3c3aac2bcc28ec2adc3aec3a8c2b0c3a7c28ec2afc390c3b1c28ec2bdc28ec2a24246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)