To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 臧??箋?臧??箋?B 111001000110100000111111001111111110001010110011001111111110010001101000001111110011111111100010101100110011111101000010 e4683f3fe2b33fe4683f3fe2b33f42
EUC-JP 臧??箋?臧??箋?B 111001111100100100111111001111111110010010110101001111111110011111001001001111110011111111100100101101010011111101000010 e7c93f3fe4b53fe7c93f3fe4b53f42
UTF-8 臧섉쐩箋뱕臧섉쐩箋뱕B 11101000100001111010011111101100100001001000100111101100100100001010100111100111101011101000101111101011101100011001010111101000100001111010011111101100100001001000100111101100100100001010100111100111101011101000101111101011101100011001010101000010 e887a7ec8489ec90a9e7ae8bebb195e887a7ec8489ec90a9e7ae8bebb19542
UHC 臧섉쐩箋뱕臧섉쐩箋뱕B 111011011111010110011000111001101001110010001110111011111010100010010011011101101110110111110101100110001110011010011100100011101110111110101000100100110111011001000010 edf598e69c8eefa89376edf598e69c8eefa8937642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)