To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8倚??乙μ?馭??日??巡??椰??? 11100001100111110011111110000010010101111001100011011111001111110011111110001001101100111000001111001010001111111110100101100110001111110011111110010011111110100011111100111111100011111000010000111111001111111001111010111101001111110011111100111111 e19f3f825798df3f3f89b383ca3fe9663f3f93fa3f3f8f843f3f9ebd3f3f3f
EUC-JP 癲?8倚??乙μ?馭??日??巡??椰??? 11100010101000010011111110100011101110001101000011100001001111110011111110110010101101011010011011001100001111111111000111000111001111110011111111000110111111000011111100111111101111011110010000111111001111111101110010111111001111110011111100111111 e2a13fa3b8d0e13f3fb2b5a6cc3ff1c73f3fc6fc3f3fbde43f3fdcbf3f3f3f
UTF-8 癲쒕8倚싷쭎乙μ뵯馭귙꺈日뗧춯巡볥엔椰꾩뼇柳 1110011110011001101100101110110010010010100101011110111110111100100110001110010110000000100110101110110010001011101101111110110010101101100011101110010010111001100110011100111010111100111010111011010110101111111010011010011010101101111010101011011110011001111010101011101010001000111001101001011110100101111010111001011110100111111011001011011010101111111001011011011110100001111010111011001110100101111011001001011110010100111001101010010010110000111010101011111010101001111010111011110010000111111011111010011110001001 e799b2ec9295efbc98e5809aec8bb7ecad8ee4b999cebcebb5afe9a6adeab799eaba88e697a5eb97a7ecb6afe5b7a1ebb3a5ec9794e6a4b0eabea9ebbc87efa789
UHC 癲쒕8倚싷쭎乙μ뵯馭귙꺈日뗧춯巡볥엔椰꾩뼇柳 1110111110100110100111001110101110100011101110001110101111101111100110101110111110100111100001111110101111100000101001011110110010010100101011011110010111011111100000101110001110000011101011111110110011101101100010111110011110101101100011001110001011011110100100111110101110111111101000111110010110101011100001001110110010010110100100011110101011110111 efa69ceba3b8ebef9aefa787ebe0a5ec94ade5df82e383afeced8be7ad8ce2de93ebbfa3e5ab84ec9691eaf7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)