To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??誼??怨???λ?應??蹂??汚???? 111000011001111100111111001111111000101101100010001111110011111110001001100001010011111100111111001111111000001111001001001111111001110011100100001111110011111111100110111110000011111100111111100010011001100000111111001111110011111100111111 e19f3f3f8b623f3f89853f3f3f83c93f9ce43f3fe6f83f3f89983f3f3f3f
EUC-JP 癲??誼??怨???λ?應??蹂??汚???? 111000101010000100111111001111111011010111000011001111110011111110110001111001010011111100111111001111111010011011001011001111111101100011100110001111110011111111101100111110100011111100111111101100011111100000111111001111110011111100111111 e2a13f3fb5c33f3fb1e53f3f3fa6cb3fd8e63f3fecfa3f3fb1f83f3f3f3f
UTF-8 癲용씭誼쀦레怨쀬솈列λ돁應싷쭓蹂좎툗汚살슜履잹 1110011110011001101100101110110010011010101010011110110010010100101011011110100010101010101111001110110010000000101001101110101110100000100010001110011010000000101010001110110010000000101011001110110010000110100010001110111110100110100111001100111010111011111010111000111110000001111001101000011110001001111011001000101110110111111011001010110110010011111010001011100110000010111011001010001010001110111011011000100010010111111001101011000110011010111011001000001010110100111011001000101010011100111011111010011110011111111011001001111010111001 e799b2ec9aa9ec94ade8aabcec80a6eba088e680a8ec80acec8688efa69ccebbeb8f81e68789ec8bb7ecad93e8b982eca28eed8897e6b19aec82b4ec8a9cefa79fec9eb9
UHC 癲용씭誼쀦레怨쀬솈列λ돁應싷쭓蹂좎툗汚살슜履잹 11101111101001101011111111101011100111011011111011101011111111101001011111100110101101111011100111101010101100111001011111101100100110011000110011100110111010101010010111101011100010011001010011101011111010111001101011101111101001111000101111101011101100111010000011101100101110001000111011100111111111011011101111101100100110101010100111101100101010101010000001000010 efa6bfeb9dbeebfe97e6b7b9eab397ec998ce6eaa5eb8994ebeb9aefa78bebb3a0ecb88ee7fdbbec9aa9ecaaa042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)