To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 雅??坂??宋??B 10001001111010110011111100111111100011011110001000111111001111111001000101110110001111110011111101000010 89eb3f3f8de23f3f91763f3f42
EUC-JP 雅??坂??宋??B 10110010111011010011111100111111101110101110010000111111001111111100000111010111001111110011111101000010 b2ed3f3fbae43f3fc1d73f3f42
UTF-8 雅ⓥ넀坂싦벰宋먾슊B 11101001100110111000010111100010100100111010010111101011100001001000000011100101100111011000001011101100100010111010011011101011101100101011000011100101101011101000101111101011101010001011111011101100100010101000101001000010 e99b85e293a5eb8480e59d82ec8ba6ebb2b0e5ae8beba8beec8a8a42
UHC 雅ⓥ넀坂싦벰宋먾슊B 11100100101110101010100011100010100001101001000011110111111110001001101011100100101110101010100011100001111001001001000011111000100110101001101001000010 e4baa8e28690f7f89ae4baa8e1e490f89a9a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)