To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??誼??臾??猥???筌?????臾??? 1001100011011010001111110011111110001011011000100011111100111111111001000110101100111111001111111110000011001110001111110011111100111111111000101010001100111111001111110011111100111111001111111110010001101011001111110011111100111111 98da3f3f8b623f3fe46b3f3fe0ce3f3f3fe2a33f3f3f3f3fe46b3f3f3f
EUC-JP 俑??誼??臾??猥???筌??彛??臾??? 11010000110111000011111100111111101101011100001100111111001111111110011111001100001111110011111111100000110100000011111100111111001111111110010010100101001111110011111110001111101111001111101000111111001111111110011111001100001111110011111100111111 d0dc3f3fb5c33f3fe7cc3f3fe0d03f3f3fe4a53f3f8fbcfa3f3fe7cc3f3f3f
UTF-8 俑앹뼔誼뜹렚臾먯젔猥롈쎌벣筌덈벝彛묉벉臾먯젩僚 111001001011111110010001111011001001010110111001111010111011110010010100111010001010101010111100111010111001110010111001111010111010000010011010111010001000011110111110111010111010100010101111111011001010000010010100111001111000110010100101111010111010000110001000111011001000111010001100111010111011001010100011111001111010110110001100111010111000110110001000111010111011001010011101111001011011110110011011111010111010110010001001111010111011001010001001111010001000011110111110111010111010100010101111111011001010000010101001111011111010011010111011 e4bf91ec95b9ebbc94e8aabceb9cb9eba09ae887beeba8afeca094e78ca5eba188ec8e8cebb2a3e7ad8ceb8d88ebb29de5bd9bebac89ebb289e887beeba8afeca0a9efa6bb
UHC 俑앹뼔誼뜹렚臾먯젔猥롈쎌벣筌덈벝彛묉벉臾먯젩僚 11101001101101011001110111101100100101101001110011101011111111101011011011100101100011101010110111101011101011001001000011101100101000001001001011101000111001011000111011001110101111011110110010010011101111001110111110100111100010001110101110010011101110001110110010101101100100011110011010010011101011001110101110101100100100001110110010100000101000011110100011101000 e9b59dec969cebfeb6e58eadebac90eca092e8e58ecebdec93bcefa788eb93b8ecad91e693acebac90eca0a1e8e8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)