To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 搖??節??徇?????押??慂??橈??^ 10011101100010100011111100111111100100001101111100111111001111111001110001101101001111110011111100111111001111110011111110001001100111110011111100111111100111001100100000111111001111111001111011110100001111110011111101011110 9d8a3f3f90df3f3f9c6d3f3f3f3f3f899f3f3f9cc83f3f9ef43f3f5e
EUC-JP 搖??節??徇?????押??慂??橈??^ 11011001111010100011111100111111110000001110000100111111001111111101011111001110001111110011111100111111001111110011111110110010101000010011111100111111110110001100101000111111001111111101110011110110001111110011111101011110 d9ea3f3fc0e13f3fd7ce3f3f3f3f3fb2a13f3fd8ca3f3fdcf63f3f5e
UTF-8 搖졾끁節얗숱徇쒒쐢嶺묌뼚押뤄슭慂㏂걶橈덄븳^ 11100110100100001001011011101100101000011011111011101011100000011000000111100111101011111000000011101100100101101001011111101100100010001011000111100101101111101000011111101100100100101001001011101100100100001010001011101111101001101010101111101011101011001000110011101011101111001001101011100110100010101011110011101011101001001000010011101100100010101010110111100110100001011000001011100011100011111000001011101010101100011011011011100110101010011000100011101011100011011000010011101011101110001011001101011110 e69096eca1beeb8181e7af80ec9697ec88b1e5be87ec9292ec90a2efa6abebac8cebbc9ae68abceba484ec8aade68582e38f82eab1b6e6a988eb8d84ebb8b35e
UHC 搖졾끁節얗숱徇쒒쐢嶺묌뼚押뤄슭慂㏂걶橈덄븳^ 11101000111101001010000011100101100001011011011111101111101111011011111011101001101111011010001011100010110111111001110011101001100111001000100011100111101011011001000111101001100101101010000011100100111000111011011111101111101111011011111011101001101111011010001011100011100000011001110011101000111110101000100011100111100101011001110001011110 e8f4a0e585b7efbdbee9bda2e2df9ce99c88e7ad91e996a0e4e3b7efbdbee9bda2e3819ce8fa88e7959c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)