To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 螯、貉ソ蟇倩セ柧螯、貉ソ蟇倩セ柧B 111001011010011010100100111001101011100110111111111001011010111110011000111010001011111010011110011101101110010110100110101001001110011010111001101111111110010110101111100110001110100010111110100111100111011001000010 e5a6a4e6b9bfe5af98e8be9e76e5a6a4e6b9bfe5af98e8be9e7642
EUC-JP 螯、貉ソ蟇倩セ柧螯、貉ソ蟇倩セ柧B 111010101010100010001110101001001110110010111011100011101011111111101010101100011101000011101010100011101011111011011011110101111110101010101000100011101010010011101100101110111000111010111111111010101011000111010000111010101000111010111110110110111101011101000010 eaa88ea4ecbb8ebfeab1d0ea8ebedbd7eaa88ea4ecbb8ebfeab1d0ea8ebedbd742
UTF-8 螯、貉ソ蟇倩セ柧螯、貉ソ蟇倩セ柧B 11101000100111101010111111101111101111011010010011101000101100101000100111101111101111011011111111101000100111111000011111100101100000001010100111101111101111011011111011100110100111111010011111101000100111101010111111101111101111011010010011101000101100101000100111101111101111011011111111101000100111111000011111100101100000001010100111101111101111011011111011100110100111111010011101000010 e89eafefbda4e8b289efbdbfe89f87e580a9efbdbee69fa7e89eafefbda4e8b289efbdbfe89f87e580a9efbdbee69fa742
UHC ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)