To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 小???舌泄小???舌瀟小???舌瀟B 10001111101011000011111100111111001111111001000011100011100111111001010110001111101011000011111100111111001111111001000011100011111000000110111010001111101011000011111100111111001111111001000011100011111000000110111001000010 8fac3f3f3f90e39f958fac3f3f3f90e3e06e8fac3f3f3f90e3e06e42
EUC-JP 小??炤舌泄小??炤舌瀟小??炤舌瀟B 10111110101011100011111100111111100011111100100111010010110000001110010111011101111101011011111010101110001111110011111110001111110010011101001011000000111001011101111111001111101111101010111000111111001111111000111111001001110100101100000011100101110111111100111101000010 beae3f3f8fc9d2c0e5ddf5beae3f3f8fc9d2c0e5dfcfbeae3f3f8fc9d2c0e5dfcf42
UTF-8 小숞蟬炤舌泄小숞蟬炤舌瀟小숞蟬炤舌瀟B 11100101101100001000111111101100100010001001111011101000100111111010110011100111100000101010010011101000100010001000110011100110101100111000010011100101101100001000111111101100100010001001111011101000100111111010110011100111100000101010010011101000100010001000110011100111100000001001111111100101101100001000111111101100100010001001111011101000100111111010110011100111100000101010010011101000100010001000110011100111100000001001111101000010 e5b08fec889ee89face782a4e8888ce6b384e5b08fec889ee89face782a4e8888ce7809fe5b08fec889ee89face782a4e8888ce7809f42
UHC 小숞蟬炤舌泄小숞蟬炤舌瀟小숞蟬炤舌瀟B 11100001101100111001100111111011111000001101000111100001101111111110000011011111111000001101110011100001101100111001100111111011111000001101000111100001101111111110000011011111111000011011111011100001101100111001100111111011111000001101000111100001101111111110000011011111111000011011111001000010 e1b399fbe0d1e1bfe0dfe0dce1b399fbe0d1e1bfe0dfe1bee1b399fbe0d1e1bfe0dfe1be42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)