To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??猷??猿??艶j?苡??誘る?曖 11100001100111110011111100111111100101110101000100111111001111111000100110001110001111110011111110001001100100001000001010001010001111111110010010001111001111110011111110010111010101011000001011101001001111111001111001000010 e19f3f3f97513f3f898e3f3f8990828a3fe48f3f3f975582e93f9e42
EUC-JP 癲??猷??猿??艶j?苡??誘る?曖 11100010101000010011111100111111110011011011001000111111001111111011000111101110001111110011111110110001111100001010001111101010001111111110011111101111001111110011111111001101101101101010010011101011001111111101101110100011 e2a13f3fcdb23f3fb1ee3f3fb1f0a3ea3fe7ef3f3fcdb6a4eb3fdba3
UTF-8 癲숆낄猷쀦궇猿낅겱艶j퍔苡멨짃誘る윥曖 111001111001100110110010111011001000100010000110111010111000001010000100111001111000110010110111111011001000000010100110111010101011011010000111111001111000110010111111111010111000001010000101111010101011001010110001111010001000100110110110111011111011110110001010111011011000110110010100111010001000101110100001111010111010100110101000111011001010011110000011111010001010101010011000111000111000001010001011111011001001110010100101111001101001101110010110 e799b2ec8886eb8284e78cb7ec80a6eab687e78cbfeb8285eab2b1e889b6efbd8aed8d94e88ba1eba9a8eca783e8aa98e3828bec9ca5e69b96
UHC 癲숆낄猷쀦궇猿낅겱艶j퍔苡멨짃誘る윥曖 1110111110100110100110011110101010110011101001011110101110100011100101111110011010000010101000001110101010111011100001011110101110000001101111011110011011111101101000111110101010111011100010111110110010111110101110001110010110100011100100111110101110101111101010101110101110011111101001011110010011110010 efa699eab3a5eba397e682a0eabb85eb81bde6fda3eabb8becbeb8e5a393ebafaaeb9fa5e4f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)