To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鼇??違??幽??渦?????魏??躍??唯 1110101010000111001111110011111110001000111000010011111100111111100101110100100000111111001111111000100101010001001111110011111100111111001111110011111111101001101100000011111100111111100101101111010000111111001111111001011101000010 ea873f3f88e13f3f97483f3f89513f3f3f3f3fe9b03f3f96f43f3f9742
EUC-JP 鼇??違??幽??渦??靷??魏??躍??唯 11110011111001110011111100111111101100001110001100111111001111111100110110101001001111110011111110110001101100100011111100111111100011111110011110111101001111110011111111110010101100100011111100111111110011001111011000111111001111111100110110100011 f3e73f3fb0e33f3fcda93f3fb1b23f3f8fe7bd3f3ff2b23f3fccf63f3fcda3
UTF-8 鼇앸뵂違긺독幽뗫닟渦긱깶靷볟섧魏됲뫒躍년뫔唯 111010011011110010000111111011001001010110111000111010111011010110000010111010011000000110010101111010101011100010111010111010111000111110000101111001011011100110111101111010111001011110101011111010111000101110011111111001101011100010100110111010101011100010110001111010101011100110110110111010011001110110110111111010111011001110011111111011001000010010100111111010011010110110001111111010111001000010110010111010111010101110010010111010001011101010001101111010111000010110000100111010111010101110010100111001011001010010101111 e9bc87ec95b8ebb582e98195eab8baeb8f85e5b9bdeb97abeb8b9fe6b8a6eab8b1eab9b6e99db7ebb39fec84a7e9ad8feb90b2ebab92e8ba8deb8584ebab94e594af
UHC 鼇앸뵂違긺독幽뗫닟渦긱깶靷볟섧魏됲뫒躍년뫔唯 1110100010101000100111011110101110010100100010001110101011011110101100011110011110110101101101101110101011101011100010111110101110001000100111111110100010111110101100011110001110000011101001001110110011100110100100111110010110111100101101011110101011100000100010011110110110010001101101001110010110111000101100111110001010010001101101101110101011100110 e8a89deb9488eadeb1e7b5b6eaeb8beb889fe8beb1e383a4ece693e5bcb5eae089ed91b4e5b8b3e291b6eae6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)