To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 溜삳젷溜삳ℓ溜뺣졎溜뱕溜삳젷溜삳ℓ溜뺣졎溜뱕B 11101111101001111000101111101100100000101011001111101100101000001011011111101111101001111000101111101100100000101011001111100010100001001001001111101111101001111000101111101011101110101010001111101100101000011000111011101111101001111000101111101011101100011001010111101111101001111000101111101100100000101011001111101100101000001011011111101111101001111000101111101100100000101011001111100010100001001001001111101111101001111000101111101011101110101010001111101100101000011000111011101111101001111000101111101011101100011001010101000010 efa78bec82b3eca0b7efa78bec82b3e28493efa78bebbaa3eca18eefa78bebb195efa78bec82b3eca0b7efa78bec82b3e28493efa78bebbaa3eca18eefa78bebb19542
UHC 溜삳젷溜삳ℓ溜뺣졎溜뱕溜삳젷溜삳ℓ溜뺣졎溜뱕B 111010101111111010111011111010111010000010101011111010101111111010111011111010111010011110100100111010101111111010010101111010111010000010111011111010101111111010010011011101101110101011111110101110111110101110100000101010111110101011111110101110111110101110100111101001001110101011111110100101011110101110100000101110111110101011111110100100110111011001000010 eafebbeba0abeafebbeba7a4eafe95eba0bbeafe9376eafebbeba0abeafebbeba7a4eafe95eba0bbeafe937642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)