To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????}B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
SJIS-WIN 小???舌堯小?縟?舌邵小?縟?舌妖}B 10001111101011000011111100111111001111111001000011100011111010101001111110001111101011000011111111100011011101000011111110010000111000111110011110111000100011111010110000111111111000110111010000111111100100001110001110010111011001000111110101000010 8fac3f3f3f90e3ea9f8fac3fe3743f90e3e7b88fac3fe3743f90e397647d42
EUC-JP 小??炤舌堯小?縟炤舌邵小?縟炤舌妖}B 10111110101011100011111100111111100011111100100111010010110000001110010111110100101000011011111010101110001111111110010111010101100011111100100111010010110000001110010111101110101110101011111010101110001111111110010111010101100011111100100111010010110000001110010111001101110001010111110101000010 beae3f3f8fc9d2c0e5f4a1beae3fe5d58fc9d2c0e5eebabeae3fe5d58fc9d2c0e5cdc57d42
UTF-8 小숞蟬炤舌堯小숞縟炤舌邵小숞縟炤舌妖}B 1110010110110000100011111110110010001000100111101110100010011111101011001110011110000010101001001110100010001000100011001110010110100000101011111110010110110000100011111110110010001000100111101110011110111000100111111110011110000010101001001110100010001000100011001110100110000010101101011110010110110000100011111110110010001000100111101110011110111000100111111110011110000010101001001110100010001000100011001110010110100110100101100111110101000010 e5b08fec889ee89face782a4e8888ce5a0afe5b08fec889ee7b89fe782a4e8888ce982b5e5b08fec889ee7b89fe782a4e8888ce5a6967d42
UHC 小숞蟬炤舌堯小숞縟炤舌邵小숞縟炤舌妖}B 1110000110110011100110011111101111100000110100011110000110111111111000001101111111101000111010111110000110110011100110011111101111101001101100101110000110111111111000001101111111100001110100001110000110110011100110011111101111101001101100101110000110111111111000001101111111101000111011010111110101000010 e1b399fbe0d1e1bfe0dfe8ebe1b399fbe9b2e1bfe0dfe1d0e1b399fbe9b2e1bfe0dfe8ed7d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)