To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥????????循??厄β?維?ぜ儒?? 11100000110011100011111100111111001111110011111100111111001111110011111100111111100011110111101000111111001111111001011011101111100000111100000000111111100010001101101100111111100000101011101010001110111100100011111100111111 e0ce3f3f3f3f3f3f3f3f8f7a3f3f96ef83c03f88db3f82ba8ef23f3f
EUC-JP 猥??堉?????循??厄β?維?ぜ儒?? 111000001101000000111111001111111000111110110111111111010011111100111111001111110011111100111111101111011101101100111111001111111100110011110001101001101100001000111111101100001101110100111111101001001011110010111100111101000011111100111111 e0d03f3f8fb7fd3f3f3f3f3fbddb3f3fccf1a6c23fb0dd3fa4bcbcf43f3f
UTF-8 猥롢뀧堉쏄여琉껆춱循뗪퍥厄β뼰維뽬ぜ儒븍춴 1110011110001100101001011110101110100001101000101110101110000000101001111110010110100000100010011110110010001111100001001110110010010111101011001110111110100111100011001110101010111011100001101110110010110110101100011110010110111110101010101110101110010111101010101110110110001101101001011110010110001110100001001100111010110010111010111011110010110000111001111011011010101101111010111011110110101100111000111000000110011100111001011000010010010010111010111011100010001101111011001011011010110100 e78ca5eba1a2eb80a7e5a089ec8f84ec97acefa78ceabb86ecb6b1e5beaaeb97aaed8da5e58e84ceb2ebbcb0e7b6adebbdace3819ce58492ebb88decb6b4
UHC 猥롢뀧堉쏄여琉껆춱循뗪퍥厄β뼰維뽬ぜ儒븍춴 111010001110010110001110111000111000010110011110111010111011110010011011111010101011111110101001111010111010010010000011111001111010110110001101111000101110000010001011111010101011101110011100111001001111100010100101111000101001011010110011111010111010101110010110111010001010101010111100111010101110001110111010111010111010110110010000 e8e58ee3859eebbc9beabfa9eba483e7ad8de2e08beabb9ce4f8a5e296b3ebab96e8aabceae3baebad90

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)