To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN テヲテケツ崚ァツ「ツ絞テヲテケツ崚ァツ「ツ絞B 110000111010011011000011101110011100001010011011110000111010011111000010101000101100001010001101011010011100001110100110110000111011100111000010100110111100001110100111110000101010001011000010100011010110100101000010 c3a6c3b9c29bc3a7c2a2c28d69c3a6c3b9c29bc3a7c2a2c28d6942
EUC-JP テヲテケツ崚ァツ「ツ絞テヲテケツ崚ァツ「ツ絞B 100011101100001110001110101001101000111011000011100011101011100110001110110000101101011011000101100011101010011110001110110000101000111010100010100011101100001010111001110010101000111011000011100011101010011010001110110000111000111010111001100011101100001011010110110001011000111010100111100011101100001010001110101000101000111011000010101110011100101001000010 8ec38ea68ec38eb98ec2d6c58ea78ec28ea28ec2b9ca8ec38ea68ec38eb98ec2d6c58ea78ec28ea28ec2b9ca42
UTF-8 テヲテケツ崚ァツ「ツ絞テヲテケツ崚ァツ「ツ絞B 11101111101111101000001111101111101111011010011011101111101111101000001111101111101111011011100111101111101111101000001011100101101101001001101011101111101111011010011111101111101111101000001011101111101111011010001011101111101111101000001011100111101101011001111011101111101111101000001111101111101111011010011011101111101111101000001111101111101111011011100111101111101111101000001011100101101101001001101011101111101111011010011111101111101111101000001011101111101111011010001011101111101111101000001011100111101101011001111001000010 efbe83efbda6efbe83efbdb9efbe82e5b49aefbda7efbe82efbda2efbe82e7b59eefbe83efbda6efbe83efbdb9efbe82e5b49aefbda7efbe82efbda2efbe82e7b59e42
UHC ??????????絞??????????絞B 00111111001111110011111100111111001111110011111100111111001111110011111100111111110011101110110100111111001111110011111100111111001111110011111100111111001111110011111100111111110011101110110101000010 3f3f3f3f3f3f3f3f3f3fceed3f3f3f3f3f3f3f3f3f3fceed42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)