To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??魚???雍?┗辛 0011111100111111100010111001101100111111001111110011111111101000101101000011111110000100101011111001000001101000 3f3f8b9b3f3f3fe8b43f84af9068
EUC-JP 澐?魚???雍?┗辛 10001111110010001110100100111111101101011111101100111111001111110011111111110000101101100011111110101000101100011011111111001001 8fc8e93fb5fb3f3f3ff0b63fa8b1bfc9
UTF-8 澐렏魚편렒렒雍멨┗辛 111001101011111010010000111010111010000010001111111010011010110110011010111011011000111010111000111010111010000010010010111010111010000010010010111010011001101110001101111010111010100110101000111000101001010010010111111010001011111010011011 e6be90eba08fe9ad9aed8eb8eba092eba092e99b8deba9a8e29497e8be9b
UHC 澐렏魚편렒렒雍멨┗辛 1110100111111010100011101010010111100101111000001100011011101101100011101010011110001110101001111110100010111100101110001110010110100110101100011110001111110100 e9fa8ea5e5e0c6ed8ea78ea7e8bcb8e5a6b1e3f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)