To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厭??晤??壓??濡???????????^ 1000100101111101001111110011111110011101111010110011111100111111100110101101100000111111001111111001010001000111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 897d3f3f9deb3f3f9ad83f3f94473f3f3f3f3f3f3f3f3f3f3f5e
EUC-JP 厭??晤??壓??濡???????????^ 1011000111011110001111110011111111011010111011010011111100111111110101001101101000111111001111111100011110101000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 b1de3f3fdaed3f3fd4da3f3fc7a83f3f3f3f3f3f3f3f3f3f3f5e
UTF-8 厭얜졑晤몃젵壓믩졁濡싩뼻溜얕씤溜⒴꼺溜뀀줁^ 11100101100011101010110111101100100101101001110011101100101000011001000111100110100110011010010011101011101010101000001111101100101000001011010111100101101000111001001111101011101011111010100111101100101000011000000111100110101111111010000111101100100010111010100111101011101111001011101111101111101001111000101111101100100101101001010111101100100101001010010011101111101001111000101111100010100100101011010011101010101111001011101011101111101001111000101111101011100000001000000011101100101001001000000101011110 e58eadec969ceca191e699a4ebaa83eca0b5e5a393ebafa9eca181e6bfa1ec8ba9ebbcbbefa78bec9695ec94a4efa78be292b4eabcbaefa78beb8080eca4815e
UHC 厭얜졑晤몃젵壓믩졁濡싩뼻溜얕씤溜⒴꼺溜뀀줁^ 11100110111101001011111011101011101000001011111011100111111110111011100011101011101000001010100111100100111000101001001011101011101000001011001011101011101000011001101011100111100101101011111011101010111111101011111011101000100111011011100011101010111111101010100111100101100001001001001011101010111111101011001011101011101000011001100001011110 e6f4beeba0bee7fbb8eba0a9e4e292eba0b2eba19ae796beeafebee89db8eafea9e58492eafeb2eba1985e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)