To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??〕泣??怨レ???〕泣??怨ル?^ 001111110011111110000001011011001000101110000011001111110011111110001001100001011000001110001100001111110011111100111111100000010110110010001011100000110011111100111111100010011000010110000011100010110011111101011110 3f3f816c8b833f3f8985838c3f3f3f816c8b833f3f8985838b3f5e
EUC-JP 艅?〕泣?ł怨レ?艅?〕泣?ł怨ル?^ 1000111111010110111111010011111110100001110011011011010111100011001111111000111110101001110010001011000111100101101001011110110000111111100011111101011011111101001111111010000111001101101101011110001100111111100011111010100111001000101100011110010110100101111010110011111101011110 8fd6fd3fa1cdb5e33f8fa9c8b1e5a5ec3f8fd6fd3fa1cdb5e33f8fa9c8b1e5a5eb3f5e
UTF-8 艅덈〕泣당ł怨レ삒艅덈〕泣당ł怨ル성^ 1110100010001001100001011110101110001101100010001110001110000000100101011110011010110011101000111110101110001011101110011100010110000010111001101000000010101000111000111000001110101100111011001000001010010010111010001000100110000101111010111000110110001000111000111000000010010101111001101011001110100011111010111000101110111001110001011000001011100110100000001010100011100011100000111010101111101100100001001011000101011110 e88985eb8d88e38095e6b3a3eb8bb9c582e680a8e383acec8292e88985eb8d88e38095e6b3a3eb8bb9c582e680a8e383abec84b15e
UHC 艅덈〕泣당ł怨レ삒艅덈〕泣당ł怨ル성^ 11100110101010011000100011101011101000011011001111101011111010001011010011100111101010011010100111101010101100111010101111101100100110001001011111100110101010011000100011101011101000011011001111101011111010001011010011100111101010011010100111101010101100111010101111101011101111001011101001011110 e6a988eba1b3ebe8b4e7a9a9eab3abec9897e6a988eba1b3ebe8b4e7a9a9eab3abebbcba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)