To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?脹????澣荊?荊?枇???暴??懶ぇ邯え?^ 00111111100100101010111100111111001111110011111100111111111000000101010010001100011101000011111110001100011101000011111110010100111110000011111100111111001111111001011001011100001111110011111110011100111011111000001010100101111001111011011010000010101001100011111101011110 3f92af3f3f3f3fe0548c743f8c743f94f83f3f3f965c3f3f9cef82a5e7b682a63f5e
EUC-JP ?脹????澣荊?荊?枇???暴??懶ぇ邯え?^ 00111111110001001011000100111111001111110011111100111111110111111011010110110111110101010011111110110111110101010011111111001000111110100011111100111111001111111100101110111101001111110011111111011000111100011010010010100111111011101011100010100100101010000011111101011110 3fc4b13f3f3f3fdfb5b7d53fb7d53fc8fa3f3f3fcbbd3f3fd8f1a4a7eeb8a4a83f5e
UTF-8 뤋脹쭗샘렑뤋澣荊㎁荊㉤枇샅렒뤋暴쫸샘懶ぇ邯え뒬^ 11101011101001001000101111101000100001001011100111101100101011011001011111101100100000111001100011101011101000001001000111101011101001001000101111100110101111101010001111101000100011011000101011100011100011101000000111101000100011011000101011100011100010011010010011100110100111101000011111101100100000111000010111101011101000001001001011101011101001001000101111100110100110101011010011101100101010111011100011101100100000111001100011100110100001111011011011100011100000011000011111101001100000101010111111100011100000011000100011101011100100101010110001011110 eba48be884b9ecad97ec8398eba091eba48be6bea3e88d8ae38e81e88d8ae389a4e69e87ec8385eba092eba48be69ab4ecabb8ec8398e687b6e38187e982afe38188eb92ac5e
UHC 뤋脹쭗샘렑뤋澣荊㎁荊㉤枇샅렒뤋暴쫸샘懶ぇ邯え뒬^ 1000111110111011111100111110110010100111100011111011101111111001100011101010011010001111101110111111100111010100111110111010101010100111110010101111101110101010101010001011010111011101111011011011101111110100100011101010011110001111101110111111100011101100101001101000111110111011111110011101010011111011101010101010011111001010111110111010101010101000101101011101110001011110 8fbbf3eca78fbbf98ea68fbbf9d4fbaaa7cafbaaa8b5ddedbbf48ea78fbbf8eca68fbbf9d4fbaaa7cafbaaa8b5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)