To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????}v??????}vB 0011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f7d763f3f3f3f3f3f7d7642
SJIS-WIN 盖哈鶺眩瘁鶺}v盖哈鶺眩瘁鶺}vB 1110000110110011100110011111101111101010010101001110000110111111111000011000000111101010010101000111110101110110111000011011001110011001111110111110101001010100111000011011111111100001100000011110101001010100011111010111011001000010 e1b399fbea54e1bfe181ea547d76e1b399fbea54e1bfe181ea547d7642
EUC-JP 盖哈鶺眩瘁鶺}v盖哈鶺眩瘁鶺}vB 1110001010110101110100101111110111110011101101011110001011000001111000011110000111110011101101010111110101110110111000101011010111010010111111011111001110110101111000101100000111100001111000011111001110110101011111010111011001000010 e2b5d2fdf3b5e2c1e1e1f3b57d76e2b5d2fdf3b5e2c1e1e1f3b57d7642
UTF-8 盖哈鶺眩瘁鶺}v盖哈鶺眩瘁鶺}vB 1110011110011011100101101110010110010011100010001110100110110110101110101110011110011100101010011110011110011000100000011110100110110110101110100111110101110110111001111001101110010110111001011001001110001000111010011011011010111010111001111001110010101001111001111001100010000001111010011011011010111010011111010111011001000010 e79b96e59388e9b6bae79ca9e79881e9b6ba7d76e79b96e59388e9b6bae79ca9e79881e9b6ba7d7642
UHC 盖哈?眩??}v盖哈?眩??}vB 1100101111001100111110011110101100111111111110101101111100111111001111110111110101110110110010111100110011111001111010110011111111111010110111110011111100111111011111010111011001000010 cbccf9eb3ffadf3f3f7d76cbccf9eb3ffadf3f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)