To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 盖哈疽眩瘁齏盖哈鶺眩瘁疽盖哈疽眩瘁齏 111000011011001110011001111110111110000101110011111000011011111111100001100000011110100011101011111000011011001110011001111110111110101001010100111000011011111111100001100000011110000101110011111000011011001110011001111110111110000101110011111000011011111111100001100000011110100011101011 e1b399fbe173e1bfe181e8ebe1b399fbea54e1bfe181e173e1b399fbe173e1bfe181e8eb
EUC-JP 盖哈疽眩瘁齏盖哈鶺眩瘁疽盖哈疽眩瘁齏 111000101011010111010010111111011110000111010100111000101100000111100001111000011111000011101101111000101011010111010010111111011111001110110101111000101100000111100001111000011110000111010100111000101011010111010010111111011110000111010100111000101100000111100001111000011111000011101101 e2b5d2fde1d4e2c1e1e1f0ede2b5d2fdf3b5e2c1e1e1e1d4e2b5d2fde1d4e2c1e1e1f0ed
UTF-8 盖哈疽眩瘁齏盖哈鶺眩瘁疽盖哈疽眩瘁齏 111001111001101110010110111001011001001110001000111001111001011010111101111001111001110010101001111001111001100010000001111010011011110110001111111001111001101110010110111001011001001110001000111010011011011010111010111001111001110010101001111001111001100010000001111001111001011010111101111001111001101110010110111001011001001110001000111001111001011010111101111001111001110010101001111001111001100010000001111010011011110110001111 e79b96e59388e796bde79ca9e79881e9bd8fe79b96e59388e9b6bae79ca9e79881e796bde79b96e59388e796bde79ca9e79881e9bd8f
UHC 盖哈疽眩??盖哈?眩?疽盖哈疽眩?? 110010111100110011111001111010111110111011000101111110101101111100111111001111111100101111001100111110011110101100111111111110101101111100111111111011101100010111001011110011001111100111101011111011101100010111111010110111110011111100111111 cbccf9ebeec5fadf3f3fcbccf9eb3ffadf3feec5cbccf9ebeec5fadf3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)