To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 臾щぜ臾щぜB 11100100011010111000010010001011100000101011101011100100011010111000010010001011100000101011101001000010 e46b848b82bae46b848b82ba42
EUC-JP 臾щぜ臾щぜB 11100111110011001010011111101011101001001011110011100111110011001010011111101011101001001011110001000010 e7cca7eba4bce7cca7eba4bc42
UTF-8 臾щぜ臾щぜB 1110100010000111101111101101000110001001111000111000000110011100111010001000011110111110110100011000100111100011100000011001110001000010 e887bed189e3819ce887bed189e3819c42
UHC 臾щぜ臾щぜB 11101011101011001010110011101011101010101011110011101011101011001010110011101011101010101011110001000010 ebacacebaabcebacacebaabc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)