To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???援??袁j?齬??猷??惟??曖 0011111100111111001111111000100110000111001111110011111111100101110011011000001010001010001111111110101010010111001111110011111110010111010100010011111100111111100010001101001000111111001111111001111001000010 3f3f3f89873f3fe5cd828a3fea973f3f97513f3f88d23f3f9e42
EUC-JP 艅??援??袁j?齬??猷??惟??曖 10001111110101101111110100111111001111111011000111100111001111110011111111101010110011111010001111101010001111111111001111110111001111110011111111001101101100100011111100111111101100001101010000111111001111111101101110100011 8fd6fd3f3fb1e73f3feacfa3ea3ff3f73f3fcdb23f3fb0d43f3fdba3
UTF-8 艅덈낄援앮껸袁j뻗齬잆굥猷됪첀惟깅윥曖 111010001000100110000101111010111000110110001000111010111000001010000100111001101000111110110100111011001001010110101110111010101011101110111000111010001010001010000001111011111011110110001010111010111011101110010111111010011011110110101100111011001001111010000110111010101011010110100101111001111000110010110111111010111001000010101010111011001011001010000000111001101000001110011111111010101011100110000101111011001001110010100101111001101001101110010110 e88985eb8d88eb8284e68fb4ec95aeeabbb8e8a281efbd8aebbb97e9bdacec9e86eab5a5e78cb7eb90aaecb280e6839feab985ec9ca5e69b96
UHC 艅덈낄援앮껸袁j뻗齬잆굥猷됪첀惟깅윥曖 1110011010101001100010001110101110110011101001011110101010110101100111011110011010110010101110011110101010111110101000111110101010111011101110001110010111100001100111111110001110000010100010111110101110100011100010011110011010101010100011011110101011101110101100011110101110011111101001011110010011110010 e6a988ebb3a5eab59de6b2b9eabea3eabbb8e5e19fe3828beba389e6aa8deaeeb1eb9fa5e4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)