To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ???五??嚴??甸????五??嚴??甸?E 0011111100111111001111111000110011011100001111110011111110011010100011100011111100111111100110011011001000111111001111110011111100111111100011001101110000111111001111111001101010001110001111110011111110011001101100100011111101000101 3f3f3f8cdc3f3f9a8e3f3f99b23f3f3f3f8cdc3f3f9a8e3f3f99b23f45
EUC-JP ???五??嚴??甸????五??嚴??甸?E 0011111100111111001111111011100011011110001111110011111111010011111011100011111100111111110100101011010000111111001111110011111100111111101110001101111000111111001111111101001111101110001111110011111111010010101101000011111101000101 3f3f3fb8de3f3fd3ee3f3fd2b43f3f3f3fb8de3f3fd3ee3f3fd2b43f45
UTF-8 聯앸젗五묓렚嚴딃땶甸멮聯앸젗五묓렚嚴딃땶甸멳E 11101111101001101001011111101100100101011011100011101100101000001001011111100100101110101001010011101011101011001001001111101011101000001001101011100101100110101011010011101011100101001000001111101011100101011011011011100111100101001011100011101011101010011010111011101111101001101001011111101100100101011011100011101100101000001001011111100100101110101001010011101011101011001001001111101011101000001001101011100101100110101011010011101011100101001000001111101011100101011011011011100111100101001011100011101011101010011011001101000101 efa697ec95b8eca097e4ba94ebac93eba09ae59ab4eb9483eb95b6e794b8eba9aeefa697ec95b8eca097e4ba94ebac93eba09ae59ab4eb9483eb95b6e794b8eba9b345
UHC 聯앸젗五묓렚嚴딃땶甸멮聯앸젗五묓렚嚴딃땶甸멳E 111001101110000110011101111010111010000010010011111001111110100110010001111011011000111010101101111001011111000110001010111010011000101110001100111011111010010010010001010110011110011011100001100111011110101110100000100100111110011111101001100100011110110110001110101011011110010111110001100010101110100110001011100011001110111110100100100100010110001001000101 e6e19deba093e7e991ed8eade5f18ae98b8cefa49159e6e19deba093e7e991ed8eade5f18ae98b8cefa4916245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)