To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??〕泣??矣????〕泣??矣??B 00111111001111111000000101101100100010111000001100111111001111111110000111100001001111110011111100111111001111111000000101101100100010111000001100111111001111111110000111100001001111110011111101000010 3f3f816c8b833f3fe1e13f3f3f3f816c8b833f3fe1e13f3f42
EUC-JP 艅?〕泣??矣??艅?〕泣??矣??B 1000111111010110111111010011111110100001110011011011010111100011001111110011111111100010111000110011111100111111100011111101011011111101001111111010000111001101101101011110001100111111001111111110001011100011001111110011111101000010 8fd6fd3fa1cdb5e33f3fe2e33f3f8fd6fd3fa1cdb5e33f3fe2e33f3f42
UTF-8 艅덈〕泣닸뿿矣몄돇艅덈〕泣닸뿿矣몄돇B 11101000100010011000010111101011100011011000100011100011100000001001010111100110101100111010001111101011100010111011100011101011101111111011111111100111100111111010001111101011101010101000010011101011100011111000011111101000100010011000010111101011100011011000100011100011100000001001010111100110101100111010001111101011100010111011100011101011101111111011111111100111100111111010001111101011101010101000010011101011100011111000011101000010 e88985eb8d88e38095e6b3a3eb8bb8ebbfbfe79fa3ebaa84eb8f87e88985eb8d88e38095e6b3a3eb8bb8ebbfbfe79fa3ebaa84eb8f8742
UHC 艅덈〕泣닸뿿矣몄돇艅덈〕泣닸뿿矣몄돇B 11100110101010011000100011101011101000011011001111101011111010001011010011100110100101111011111111101011111110001011100011101100100010011001100011100110101010011000100011101011101000011011001111101011111010001011010011100110100101111011111111101011111110001011100011101100100010011001100001000010 e6a988eba1b3ebe8b4e697bfebf8b8ec8998e6a988eba1b3ebe8b4e697bfebf8b8ec899842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)