To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥????????循??亦??逾↓.魏?? 11100000110011100011111100111111001111110011111100111111001111110011111100111111100011110111101000111111001111111001011010010010001111110011111111100111101001011000000110101011100000010100010011101001101100000011111100111111 e0ce3f3f3f3f3f3f3f3f8f7a3f3f96923f3fe7a581ab8144e9b03f3f
EUC-JP 猥??堉?????循??亦??逾↓.魏?? 111000001101000000111111001111111000111110110111111111010011111100111111001111110011111100111111101111011101101100111111001111111100101111110010001111110011111111101110101001111010001010101101101000011010010111110010101100100011111100111111 e0d03f3f8fb7fd3f3f3f3f3fbddb3f3fcbf23f3feea7a2ada1a5f2b23f3f
UTF-8 猥롢뀧堉쏄여琉껆춱循뗪퍥亦껋꼦逾↓.魏녿굵 111001111000110010100101111010111010000110100010111010111000000010100111111001011010000010001001111011001000111110000100111011001001011110101100111011111010011110001100111010101011101110000110111011001011011010110001111001011011111010101010111010111001011110101010111011011000110110100101111001001011101010100110111010101011101110001011111010101011110010100110111010011000000010111110111000101000011010010011111011111011110010001110111010011010110110001111111010111000010110111111111010101011010110110101 e78ca5eba1a2eb80a7e5a089ec8f84ec97acefa78ceabb86ecb6b1e5beaaeb97aaed8da5e4baa6eabb8beabca6e980bee28693efbc8ee9ad8feb85bfeab5b5
UHC 猥롢뀧堉쏄여琉껆춱循뗪퍥亦껋꼦逾↓.魏녿굵 111010001110010110001110111000111000010110011110111010111011110010011011111010101011111110101001111010111010010010000011111001111010110110001101111000101110000010001011111010101011101110011100111001101011001010000011111011001000010010000011111010111011010110100001111010011010001110101110111010101110000010000110111010111011000110111101 e8e58ee3859eebbc9beabfa9eba483e7ad8de2e08beabb9ce6b283ec8483ebb5a1e9a3aeeae086ebb1bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)