To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???萸ы???ォ???淫?????濡?? 0011111100111111001111111110010011001110100001001000110100111111001111110011111110000011010010000011111100111111001111111000100011111010001111110011111100111111001111110011111110010100010001110011111100111111 3f3f3fe4ce848d3f3f3f83483f3f3f88fa3f3f3f3f3f94473f3f
EUC-JP ???萸ы???ォ???淫?????濡?? 0011111100111111001111111110100011010000101001111110110100111111001111110011111110100101101010010011111100111111001111111011000011111100001111110011111100111111001111110011111111000111101010000011111100111111 3f3f3fe8d0a7ed3f3f3fa5a93f3f3fb0fc3f3f3f3f3fc7a83f3f
UTF-8 琉딀텋萸ы뜔隣삣ォ女뉙쓲淫곗뵍溜⑸죲濡쀫젽 1110111110100111100011001110101110010100100000001110110110000101100010111110100010010000101110001101000110001011111010111001110010010100111011111010011110110001111011001000001010100011111000111000001010101001111011111010011010000001111010111000100110011001111011001001001110110010111001101011011110101011111010101011001110010111111010111011010110001101111011111010011110001011111000101001000110111000111011001010001110110010111001101011111110100001111011001000000010101011111011001010000010111101 efa78ceb9480ed858be890b8d18beb9c94efa7b1ec82a3e382a9efa681eb8999ec93b2e6b7abeab397ebb58defa78be291b8eca3b2e6bfa1ec80abeca0bd
UHC 琉딀텋萸ы뜔隣삣ォ女뉙쓲淫곗뵍溜⑸죲濡쀫젽 111010111010010010001010111001101011011010001000111010111010110110101100111011011000110110010111111011001110010010111011111001011010101110101001111001011111110010000111111011011001110110010000111010111110001010110000111011001001010010010000111010101111111010101001111010111010000110001101111010111010000110010111111010111010000010101111 eba48ae6b688ebadaced8d97ece4bbe5aba9e5fc87ed9d90ebe2b0ec9490eafea9eba18deba197eba0af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)