To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥??肄ゅ?依???⑥?夜???θ?揄?? 1110010111110001001111110011111111100011111001011000001011100011001111111000100011001011001111110011111100111111100001110100010100111111100101101110100100111111001111110011111110000011110001100011111110011101100010010011111100111111 e5f13f3fe3e582e33f88cb3f3f3f87453f96e93f3f3f83c63f9d893f3f
EUC-JP 褥?ŀ肄ゅ?依?????夜???θ?揄?? 111010101111001100111111100011111010100111001001111001101110011110100100111001010011111110110000110011010011111100111111001111110011111100111111110011001110101100111111001111110011111110100110110010000011111111011001111010010011111100111111 eaf33f8fa9c9e6e7a4e53fb0cd3f3f3f3f3fcceb3f3f3fa6c83fd9e93f3f
UTF-8 褥띕ŀ肄ゅ쮦依띺낭硫⑥쉠夜껋뮁杻θ삃揄용쑖 11101000101001001010010111101011100111011001010111000101100000001110100010000010100001001110001110000010100001011110110010101110101001101110010010111110100111011110101110011101101110101110101110000010101011011110111110100111100011101110001010010001101001011110110010001001101000001110010110100100100111001110101010111011100010111110101110101110100000011110111110100111100010001100111010111000111011001000001010000011111001101000111110000100111011001001101010101001111011001001000110010110 e8a4a5eb9d95c580e88284e38285ecaea6e4be9deb9dbaeb82adefa78ee291a5ec89a0e5a49ceabb8bebae81efa788ceb8ec8283e68f84ec9aa9ec9196
UHC 褥띕ŀ肄ゅ쮦依띺낭硫⑥쉠夜껋뮁杻θ삃揄용쑖 111010011011001110110110111010111010100110101000111011001011110110101010111001011010100010000011111010111110111010001101111010011011001110110110111010111010100110101000111011001011110110101010111001011010100010000011111011001001001010010000111010101111010010100101111010001001100010001010111010101111000110111111111010111001110010110101 e9b3b6eba9a8ecbdaae5a883ebee8de9b3b6eba9a8ecbdaae5a883ec9290eaf4a5e8988aeaf1bfeb9cb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)