To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 撓ο?穩?????遙??節??澳?ⅱ臆 100111011001101010000011110011010011111111100010011100100011111100111111001111110011111100111111111010101010000100111111001111111001000011011111001111110011111111100000010100110011111111111010010000011000100110110000 9d9a83cd3fe2723f3f3f3f3feaa13f3f90df3f3fe0533ffa4189b0
EUC-JP 撓ο?穩??旿??遙??節??澳??臆 11011001111110101010011011001111001111111110001111010011001111110011111110001111110000011111010000111111001111111111010010100011001111110011111111000000111000010011111100111111110111111011010000111111001111111011001010110010 d9faa6cf3fe3d33f3f8fc1f43f3ff4a33f3fc0e13f3fdfb43f3fb2b2
UTF-8 撓ο슝穩뚳슈旿댐슁遙뤹쭣節곤슝澳뺞ⅱ臆 1110011010010010100100111100111010111111111011001000101010011101111001111010100110101001111010111001101010110011111011001000101010001000111001101001011110111111111010111000110010010000111011001000101010000001111010011000000110011001111010111010010010111001111011001010110110100011111001111010111110000000111010101011001110100100111011001000101010011101111001101011111010110011111010111011101010011110111000101000010110110001111010001000011110000110 e69293cebfec8a9de7a9a9eb9ab3ec8a88e697bfeb8c90ec8a81e98199eba4b9ecada3e7af80eab3a4ec8a9de6beb3ebba9ee285b1e88786
UHC 撓ο슝穩뚳슈旿댐슁遙뤹쭣節곤슝澳뺞ⅱ臆 1110100011110101101001011110111110111101101110011110100010110001100011001110111110111101101101001110011111111010101101001110111110111101101100111110100110101011100011111110011110100111100110001110111110111101101100001110111110111101101110011110011111111110100101011110011010100101101000101110010111100110 e8f5a5efbdb9e8b18cefbdb4e7fab4efbdb3e9ab8fe7a798efbdb0efbdb9e7fe95e6a5a2e5e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)