To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 盖喊竟盖俾髴眩瘁瘁眩瘁俾盖喊韈眩瘁俾 111000011011001110011010010111101110100011101101111000011011001110011000111010101110100110011100111000011011111111100001100000011110000110000001111000011011111111100001100000011001100011101010111000011011001110011010010111101110100011100111111000011011111111100001100000011001100011101010 e1b39a5ee8ede1b398eae99ce1bfe181e181e1bfe18198eae1b39a5ee8e7e1bfe18198ea
EUC-JP 盖喊竟盖俾髴眩瘁瘁眩瘁俾盖喊韈眩瘁俾 111000101011010111010011101111111111000011101111111000101011010111010000111011001111000111111100111000101100000111100001111000011110000111100001111000101100000111100001111000011101000011101100111000101011010111010011101111111111000011101001111000101100000111100001111000011101000011101100 e2b5d3bff0efe2b5d0ecf1fce2c1e1e1e1e1e2c1e1e1d0ece2b5d3bff0e9e2c1e1e1d0ec
UTF-8 盖喊竟盖俾髴眩瘁瘁眩瘁俾盖喊韈眩瘁俾 111001111001101110010110111001011001011010001010111001111010101110011111111001111001101110010110111001001011111110111110111010011010101110110100111001111001110010101001111001111001100010000001111001111001100010000001111001111001110010101001111001111001100010000001111001001011111110111110111001111001101110010110111001011001011010001010111010011001111110001000111001111001110010101001111001111001100010000001111001001011111110111110 e79b96e5968ae7ab9fe79b96e4bfbee9abb4e79ca9e79881e79881e79ca9e79881e4bfbee79b96e5968ae99f88e79ca9e79881e4bfbe
UHC 盖喊竟盖??眩??眩??盖喊?眩?? 110010111100110011111001111000101100110011100101110010111100110000111111001111111111101011011111001111110011111111111010110111110011111100111111110010111100110011111001111000100011111111111010110111110011111100111111 cbccf9e2cce5cbcc3f3ffadf3f3ffadf3f3fcbccf9e23ffadf3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)