To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鼇??俺??狎??}鼇??俺??狎??{^ 111010101000011100111111001111111000100110110100001111110011111111100000101111100011111100111111011111011110101010000111001111110011111110001001101101000011111100111111111000001011111000111111001111110111101101011110 ea873f3f89b43f3fe0be3f3f7dea873f3f89b43f3fe0be3f3f7b5e
EUC-JP 鼇??俺??狎??}鼇??俺??狎??{^ 111100111110011100111111001111111011001010110110001111110011111111100000110000000011111100111111011111011111001111100111001111110011111110110010101101100011111100111111111000001100000000111111001111110111101101011110 f3e73f3fb2b63f3fe0c03f3f7df3e73f3fb2b63f3fe0c03f3f7b5e
UTF-8 鼇룟ㄽ俺븃퍘狎쀦쪛}鼇룟ㄽ俺븃퍘狎쀦쪛{^ 111010011011110010000111111010111010001110011111111000111000010010111101111001001011111110111010111010111011100010000011111011011000110110011000111001111000101110001110111011001000000010100110111011001010101010011011011111011110100110111100100001111110101110100011100111111110001110000100101111011110010010111111101110101110101110111000100000111110110110001101100110001110011110001011100011101110110010000000101001101110110010101010100110110111101101011110 e9bc87eba39fe384bde4bfbaebb883ed8d98e78b8eec80a6ecaa9b7de9bc87eba39fe384bde4bfbaebb883ed8d98e78b8eec80a6ecaa9b7b5e
UHC 鼇룟ㄽ俺븃퍘狎쀦쪛}鼇룟ㄽ俺븃퍘狎쀦쪛{^ 111010001010100010110111111001011010010010101101111001011110111110111010111010001011101110001111111001001110010010010111111001101010010110010100011111011110100010101000101101111110010110100100101011011110010111101111101110101110100010111011100011111110010011100100100101111110011010100101100101000111101101011110 e8a8b7e5a4ade5efbae8bb8fe4e497e6a5947de8a8b7e5a4ade5efbae8bb8fe4e497e6a5947b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)