To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ツ湘嘉督篠ウツ偲柧ツ湘嘉督篠ウツ偲柧B 11000010100011111100001110001001110000111001001111000010100011101100001010110011110000101000111011000011100111100111011011000010100011111100001110001001110000111001001111000010100011101100001010110011110000101000111011000011100111100111011001000010 c28fc389c393c28ec2b3c28ec39e76c28fc389c393c28ec2b3c28ec39e7642
EUC-JP ツ湘嘉督篠ウツ偲柧ツ湘嘉督篠ウツ偲柧B 10001110110000101011111011000101101100101100010111000110110001001011110011000100100011101011001110001110110000101011110011000101110110111101011110001110110000101011111011000101101100101100010111000110110001001011110011000100100011101011001110001110110000101011110011000101110110111101011101000010 8ec2bec5b2c5c6c4bcc48eb38ec2bcc5dbd78ec2bec5b2c5c6c4bcc48eb38ec2bcc5dbd742
UTF-8 ツ湘嘉督篠ウツ偲柧ツ湘嘉督篠ウツ偲柧B 11101111101111101000001011100110101110011001100011100101100110001000100111100111100111011010001111100111101011111010000011101111101111011011001111101111101111101000001011100101100000011011001011100110100111111010011111101111101111101000001011100110101110011001100011100101100110001000100111100111100111011010001111100111101011111010000011101111101111011011001111101111101111101000001011100101100000011011001011100110100111111010011101000010 efbe82e6b998e59889e79da3e7afa0efbdb3efbe82e581b2e69fa7efbe82e6b998e59889e79da3e7afa0efbdb3efbe82e581b2e69fa742
UHC ?湘嘉督篠?????湘嘉督篠????B 001111111101111111001111110010101010100111010100101111011110000111000110001111110011111100111111001111110011111111011111110011111100101010101001110101001011110111100001110001100011111100111111001111110011111101000010 3fdfcfcaa9d4bde1c63f3f3f3f3fdfcfcaa9d4bde1c63f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)