To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 偲室篠蒔篠シト失偲悉偲タトシト痔偲悉偲ートシト叱B 100011101100001110001110101110101000111011000010100011101010101010001110110000101011110011000100100011101011100010001110110000111000111010111011100011101100001111000000110001001011110011000100100011101010010010001110110000111000111010111011100011101100001110110000110001001011110011000100100011101011011001000010 8ec38eba8ec28eaa8ec2bcc48eb88ec38ebb8ec3c0c4bcc48ea48ec38ebb8ec3b0c4bcc48eb642
EUC-JP 偲室篠蒔篠シト失偲悉偲タトシト痔偲悉偲ートシト叱B 10111100110001011011110010111100101111001100010010111100101011001011110011000100100011101011110010001110110001001011110010111010101111001100010110111100101111011011110011000101100011101100000010001110110001001000111010111100100011101100010010111100101001101011110011000101101111001011110110111100110001011000111010110000100011101100010010001110101111001000111011000100101111001011100001000010 bcc5bcbcbcc4bcacbcc48ebc8ec4bcbabcc5bcbdbcc58ec08ec48ebc8ec4bca6bcc5bcbdbcc58eb08ec48ebc8ec4bcb842
UTF-8 偲室篠蒔篠シト失偲悉偲タトシト痔偲悉偲ートシト叱B 11100101100000011011001011100101101011101010010011100111101011111010000011101000100100101001010011100111101011111010000011101111101111011011110011101111101111101000010011100101101001001011000111100101100000011011001011100110100000101000100111100101100000011011001011101111101111101000000011101111101111101000010011101111101111011011110011101111101111101000010011100111100101111001010011100101100000011011001011100110100000101000100111100101100000011011001011101111101111011011000011101111101111101000010011101111101111011011110011101111101111101000010011100101100011111011000101000010 e581b2e5aea4e7afa0e89294e7afa0efbdbcefbe84e5a4b1e581b2e68289e581b2efbe80efbe84efbdbcefbe84e79794e581b2e68289e581b2efbdb0efbe84efbdbcefbe84e58fb142
UHC ?室篠蒔篠??失?悉?????痔?悉?????叱B 00111111111000111111100011100001110001101110001111001000111000011100011000111111001111111110001111110111001111111110001111111010001111110011111100111111001111110011111111110110110000000011111111100011111110100011111100111111001111110011111100111111111100101110101001000010 3fe3f8e1c6e3c8e1c63f3fe3f73fe3fa3f3f3f3f3ff6c03fe3fa3f3f3f3f3ff2ea42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)