To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 歪??儼??言?????音??巍???→?^ 10011000011000110011111100111111100110010101011000111111001111111000110010111110001111110011111100111111001111110011111110001001101110010011111100111111100110111101100100111111001111110011111110000001101010000011111101011110 98633f3f99563f3f8cbe3f3f3f3f3f89b93f3f9bd93f3f3f81a83f5e
EUC-JP 歪??儼??言??濚??音??巍???→?^ 110011111100010000111111001111111101000110110111001111110011111110111000110000000011111100111111100011111100100110100001001111110011111110110010101110110011111100111111110101101101101100111111001111110011111110100010101010100011111101011110 cfc43f3fd1b73f3fb8c03f3f8fc9a13f3fb2bb3f3fd6db3f3f3fa2aa3f5e
UTF-8 歪귥뇿儼싮냽言㏓젩濚껈굚音졿뮍巍띷뜤溜→짎^ 11100110101011011010101011101010101101111010010111101011100001111011111111100101100001001011110011101100100010111010111011101011100000111011110111101000101010001000000011100011100011111001001111101100101000001010100111100110101111111001101011101010101110111000100011101010101101011001101011101001100111111011001111101100101000011011111111101011101011101000110111100101101101111000110111101011100111011011011111101011100111001010010011101111101001111000101111100010100001101001001011101100101001111000111001011110 e6adaaeab7a5eb87bfe584bcec8baeeb83bde8a880e38f93eca0a9e6bf9aeabb88eab59ae99fb3eca1bfebae8de5b78deb9db7eb9ca4efa78be28692eca78e5e
UHC 歪귥뇿儼싮냽言㏓젩濚껈굚音졿뮍巍띷뜤溜→짎^ 11101000111000001000001011101100100001111010000011100101111100001001101011101001100001101000110111100101111010111010011111101011101000001010000111100111101110011000001111101001100000101000001011101011111001011010000011100110100100101001101011101000111001001000110111100110100011011010011111101010111111101010000111100110101000111001101001011110 e8e082ec87a0e5f09ae9868de5eba7eba0a1e7b983e98282ebe5a0e6929ae8e48de68da7eafea1e6a39a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)