To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????G 00111111001111110011111100111111001111110011111100111111001111110011111101000111 3f3f3f3f3f3f3f3f3f47
SJIS-WIN ??〕泣??矣??G 00111111001111111000000101101100100010111000001100111111001111111110000111100001001111110011111101000111 3f3f816c8b833f3fe1e13f3f47
EUC-JP 艅?〕泣??矣??G 100011111101011011111101001111111010000111001101101101011110001100111111001111111110001011100011001111110011111101000111 8fd6fd3fa1cdb5e33f3fe2e33f3f47
UTF-8 艅덈〕泣닸뿿矣몄돇G 11101000100010011000010111101011100011011000100011100011100000001001010111100110101100111010001111101011100010111011100011101011101111111011111111100111100111111010001111101011101010101000010011101011100011111000011101000111 e88985eb8d88e38095e6b3a3eb8bb8ebbfbfe79fa3ebaa84eb8f8747
UHC 艅덈〕泣닸뿿矣몄돇G 11100110101010011000100011101011101000011011001111101011111010001011010011100110100101111011111111101011111110001011100011101100100010011001100001000111 e6a988eba1b3ebe8b4e697bfebf8b8ec899847

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)