To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ??┝簡??┝稈^ 00111111001111111000010010111010100010101100100000111111001111111000010010111010111000100110001001011110 3f3f84ba8ac83f3f84bae2625e
EUC-JP ??┝簡??┝稈^ 00111111001111111010100010111100101101001100101000111111001111111010100010111100111000111100001101011110 3f3fa8bcb4ca3f3fa8bce3c35e
UTF-8 셈뤚┝簡셈뤚┝稈^ 11101100100001011000100011101011101001001001101011100010100101001001110111100111101100001010000111101100100001011000100011101011101001001001101011100010100101001001110111100111101010001000100001011110 ec8588eba49ae2949de7b0a1ec8588eba49ae2949de7a8885e
UHC 셈뤚┝簡셈뤚┝稈^ 1011110011000000100011111100100110100110101111001100101011011011101111001100000010001111110010011010011010111100110010101101100101011110 bcc08fc9a6bccadbbcc08fc9a6bccad95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)