To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}Y????????}bE 001111110011111100111111001111110011111100111111001111110011111101111101010110010011111100111111001111110011111100111111001111110011111100111111011111010110001001000101 3f3f3f3f3f3f3f3f7d593f3f3f3f3f3f3f3f7d6245
SJIS-WIN 霎」蠕枅霎」蠕杰}Y霎」蠕枅霎」蠕杰}bE 111010001011111010100011111001011011111010011110011001101110100010111110101000111110010110111110100111100101111001111101010110011110100010111110101000111110010110111110100111100110011011101000101111101010001111100101101111101001111001011110011111010110001001000101 e8bea3e5be9e66e8bea3e5be9e5e7d59e8bea3e5be9e66e8bea3e5be9e5e7d6245
EUC-JP 霎」蠕枅霎」蠕杰}Y霎」蠕枅霎」蠕杰}bE 11110000110000001000111010100011111010101100000011011011110001111111000011000000100011101010001111101010110000001101101110111111011111010101100111110000110000001000111010100011111010101100000011011011110001111111000011000000100011101010001111101010110000001101101110111111011111010110001001000101 f0c08ea3eac0dbc7f0c08ea3eac0dbbf7d59f0c08ea3eac0dbc7f0c08ea3eac0dbbf7d6245
UTF-8 霎」蠕枅霎」蠕杰}Y霎」蠕枅霎」蠕杰}bE 1110100110011100100011101110111110111101101000111110100010100000100101011110011010011110100001011110100110011100100011101110111110111101101000111110100010100000100101011110011010011101101100000111110101011001111010011001110010001110111011111011110110100011111010001010000010010101111001101001111010000101111010011001110010001110111011111011110110100011111010001010000010010101111001101001110110110000011111010110001001000101 e99c8eefbda3e8a095e69e85e99c8eefbda3e8a095e69db07d59e99c8eefbda3e8a095e69e85e99c8eefbda3e8a095e69db07d6245
UHC ???????杰}Y???????杰}bE 0011111100111111001111110011111100111111001111110011111111001011111110010111110101011001001111110011111100111111001111110011111100111111001111111100101111111001011111010110001001000101 3f3f3f3f3f3f3fcbf97d593f3f3f3f3f3f3fcbf97d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)