To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竣???臍豆??精?竣???臍豆?賂??址 100011110111011000111111001111110011111111100100011000001001001110100100001111110011111110010000101110000011111110001111011101100011111100111111001111111110010001100000100100111010010000111111100110000100011100111111001111111001101010101100 8f763f3f3fe46093a43f3f90b83f8f763f3f3fe46093a43f98473f3f9aac
EUC-JP 竣???臍豆??精?竣???臍豆?賂??址 101111011101011100111111001111110011111111100111110000011100011010100110001111110011111111000000101110100011111110111101110101110011111100111111001111111110011111000001110001101010011000111111110011111010100000111111001111111101010010101110 bdd73f3f3fe7c1c6a63f3fc0ba3fbdd73f3f3fe7c1c6a63fcfa83f3fd4ae
UTF-8 竣뀜렰렋臍豆렪렧精렑竣뀜렰렋臍豆렪賂렰렑址 111001111010101110100011111010111000000010011100111010111010000010110000111010111010000010001011111010001000011110001101111010001011000110000110111010111010000010101010111010111010000010100111111001111011001010111110111010111010000010010001111001111010101110100011111010111000000010011100111010111010000010110000111010111010000010001011111010001000011110001101111010001011000110000110111010111010000010101010111010001011001110000010111010111010000010110000111010111010000010010001111001011001110110000000 e7aba3eb809ceba0b0eba08be8878de8b186eba0aaeba0a7e7b2beeba091e7aba3eb809ceba0b0eba08be8878de8b186eba0aae8b382eba0b0eba091e59d80
UHC 竣뀜렰렋臍豆렪렧精렑竣뀜렰렋臍豆렪賂렰렑址 111100011110001010110010111100011000111010111101100011101010001011110000101100001101010011100111100011101011100010001110101101101110111111110001100011101010011011110001111000101011001011110001100011101011110110001110101000101111000010110000110101001110011110001110101110001101011011110001100011101011110110001110101001101111001010100011 f1e2b2f18ebd8ea2f0b0d4e78eb88eb6eff18ea6f1e2b2f18ebd8ea2f0b0d4e78eb8d6f18ebd8ea6f2a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)