To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?暴??忌げ緬え??暴??忌げ緬え?B 0011111110010110010111000011111100111111100010101111010110000010101100001001011011001001100000101010011000111111001111111001011001011100001111110011111110001010111101011000001010110000100101101100100110000010101001100011111101000010 3f965c3f3f8af582b096c982a63f3f965c3f3f8af582b096c982a63f42
EUC-JP ?暴??忌げ緬え??暴??忌げ緬え?B 0011111111001011101111010011111100111111101101001111011110100100101100101100110011001011101001001010100000111111001111111100101110111101001111110011111110110100111101111010010010110010110011001100101110100100101010000011111101000010 3fcbbd3f3fb4f7a4b2cccba4a83f3fcbbd3f3fb4f7a4b2cccba4a83f42
UTF-8 뤋暴쭗샘忌げ緬え및뤋暴쭗샘忌げ緬え및B 11101011101001001000101111100110100110101011010011101100101011011001011111101100100000111001100011100101101111111000110011100011100000011001001011100111101101111010110011100011100000011000100011101011101100001000111111101011101001001000101111100110100110101011010011101100101011011001011111101100100000111001100011100101101111111000110011100011100000011001001011100111101101111010110011100011100000011000100011101011101100001000111101000010 eba48be69ab4ecad97ec8398e5bf8ce38192e7b7ace38188ebb08feba48be69ab4ecad97ec8398e5bf8ce38192e7b7ace38188ebb08f42
UHC 뤋暴쭗샘忌げ緬え및뤋暴쭗샘忌げ緬え및B 10001111101110111111100011101100101001111000111110111011111110011101000011111011101010101011001011011000111110111010101010101000101110011101011110001111101110111111100011101100101001111000111110111011111110011101000011111011101010101011001011011000111110111010101010101000101110011101011101000010 8fbbf8eca78fbbf9d0fbaab2d8fbaaa8b9d78fbbf8eca78fbbf9d0fbaab2d8fbaaa8b9d742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)