To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??猷??猿??艶j?苡??巡??孃 111000011001111100111111001111111001011101010001001111110011111110001001100011100011111100111111100010011001000010000010100010100011111111100100100011110011111100111111100011111000010000111111001111111001101101101111 e19f3f3f97513f3f898e3f3f8990828a3fe48f3f3f8f843f3f9b6f
EUC-JP 癲??猷??猿??艶j?苡??巡??孃 111000101010000100111111001111111100110110110010001111110011111110110001111011100011111100111111101100011111000010100011111010100011111111100111111011110011111100111111101111011110010000111111001111111101010111010000 e2a13f3fcdb23f3fb1ee3f3fb1f0a3ea3fe7ef3f3fbde43f3fd5d0
UTF-8 癲숆낄猷쀦궇猿낅겱艶j퍔苡며뮫巡볥걙孃 111001111001100110110010111011001000100010000110111010111000001010000100111001111000110010110111111011001000000010100110111010101011011010000111111001111000110010111111111010111000001010000101111010101011001010110001111010001000100110110110111011111011110110001010111011011000110110010100111010001000101110100001111010111010100110110000111010111010111010101011111001011011011110100001111010111011001110100101111010101011000110011001111001011010110110000011 e799b2ec8886eb8284e78cb7ec80a6eab687e78cbfeb8285eab2b1e889b6efbd8aed8d94e88ba1eba9b0ebaeabe5b7a1ebb3a5eab199e5ad83
UHC 癲숆낄猷쀦궇猿낅겱艶j퍔苡며뮫巡볥걙孃 1110111110100110100110011110101010110011101001011110101110100011100101111110011010000010101000001110101010111011100001011110101110000001101111011110011011111101101000111110101010111011100010111110110010111110101110001110011110010010101101011110001011011110100100111110101110000001100000111110010110111110 efa699eab3a5eba397e682a0eabb85eb81bde6fda3eabb8becbeb8e792b5e2de93eb8183e5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)