To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????\ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN 篠テヲ篠ツト湿篠ツト宍篠テゥ篠ツト軸篠ツト自\ 100011101100001011000011101001101000111011000010110000101100010010001110101111001000111011000010110000101100010010001110101100111000111011000010110000111010100110001110110000101100001011000100100011101011001010001110110000101100001011000100100011101010100101011100 8ec2c3a68ec2c2c48ebc8ec2c2c48eb38ec2c3a98ec2c2c48eb28ec2c2c48ea95c
EUC-JP 篠テヲ篠ツト湿篠ツト宍篠テゥ篠ツト軸篠ツト自\ 101111001100010010001110110000111000111010100110101111001100010010001110110000101000111011000100101111001011111010111100110001001000111011000010100011101100010010111100101101011011110011000100100011101100001110001110101010011011110011000100100011101100001010001110110001001011110010110100101111001100010010001110110000101000111011000100101111001010101101011100 bcc48ec38ea6bcc48ec28ec4bcbebcc48ec28ec4bcb5bcc48ec38ea9bcc48ec28ec4bcb4bcc48ec28ec4bcab5c
UTF-8 篠テヲ篠ツト湿篠ツト宍篠テゥ篠ツト軸篠ツト自\ 11100111101011111010000011101111101111101000001111101111101111011010011011100111101011111010000011101111101111101000001011101111101111101000010011100110101110011011111111100111101011111010000011101111101111101000001011101111101111101000010011100101101011101000110111100111101011111010000011101111101111101000001111101111101111011010100111100111101011111010000011101111101111101000001011101111101111101000010011101000101110111011100011100111101011111010000011101111101111101000001011101111101111101000010011101000100001111010101001011100 e7afa0efbe83efbda6e7afa0efbe82efbe84e6b9bfe7afa0efbe82efbe84e5ae8de7afa0efbe83efbda9e7afa0efbe82efbe84e8bbb8e7afa0efbe82efbe84e887aa5c
UHC 篠??篠???篠???篠??篠??軸篠??自\ 11100001110001100011111100111111111000011100011000111111001111110011111111100001110001100011111100111111001111111110000111000110001111110011111111100001110001100011111100111111111101011110111011100001110001100011111100111111111011011011101101011100 e1c63f3fe1c63f3f3fe1c63f3f3fe1c63f3fe1c63f3ff5eee1c63f3fedbb5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)