To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???泣??由?????夭??B 00111111001111110011111110001011100000110011111100111111100101110101001000111111001111110011111100111111001111111001101011101110001111110011111101000010 3f3f3f8b833f3f97523f3f3f3f3f9aee3f3f42
EUC-JP ???泣??由??洧??夭??B 001111110011111100111111101101011110001100111111001111111100110110110011001111110011111110001111110001111011010000111111001111111101010011110000001111110011111101000010 3f3f3fb5e33f3fcdb33f3f8fc7b43f3fd4f03f3f42
UTF-8 麗몃쓹泣곲렟由앮갭洧뺤뒃夭곕쬆B 11101111101001101000100011101011101010101000001111101100100100111011100111100110101100111010001111101010101100111011001011101011101000001001111111100111100101001011000111101100100101011010111011101010101100001010110111100110101101001010011111101011101110101010010011101011100100101000001111100101101001001010110111101010101100111001010111101100101011001000011001000010 efa688ebaa83ec93b9e6b3a3eab3b2eba09fe794b1ec95aeeab0ade6b4a7ebbaa4eb9283e5a4adeab395ecac8642
UHC 麗몃쓹泣곲렟由앮갭洧뺤뒃夭곕쬆B 11100110101100001011100011101011100111011001010111101011111010001000000111101001100011101011000011101011101001101001110111100110101100001011100011101010111110111001010111101100100010101000000111101000111011001011000011101011101001101001110101000010 e6b0b8eb9d95ebe881e98eb0eba69de6b0b8eafb95ec8a81e8ecb0eba69d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)