To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弔????屯???豆?魄弔????屯???豆?白^ 100100101010001000111111001111110011111100111111100100111101010000111111001111110011111110010011101001000011111111101001101011101001001010100010001111110011111100111111001111111001001111010100001111110011111100111111100100111010010000111111100101001001001001011110 92a23f3f3f3f93d43f3f3f93a43fe9ae92a23f3f3f3f93d43f3f3f93a43f94925e
EUC-JP 弔?勖??屯??祜豆?魄弔?勖??屯??祜豆?白^ 1100010010100100001111111000111110110011111011010011111100111111110001101101011000111111001111111000111111010000110110001100011010100110001111111111001010110000110001001010010000111111100011111011001111101101001111110011111111000110110101100011111100111111100011111101000011011000110001101010011000111111110001111111001001011110 c4a43f8fb3ed3f3fc6d63f3f8fd0d8c6a63ff2b0c4a43f8fb3ed3f3fc6d63f3f8fd0d8c6a63fc7f25e
UTF-8 弔렲勖쾅렎屯렱렲祜豆렠魄弔렲勖쾅렎屯렱렲祜豆렠白^ 11100101101111001001010011101011101000001011001011100101100010111001011011101100101111101000010111101011101000001000111011100101101100011010111111101011101000001011000111101011101000001011001011100111101001011001110011101000101100011000011011101011101000001010000011101001101011011000010011100101101111001001010011101011101000001011001011100101100010111001011011101100101111101000010111101011101000001000111011100101101100011010111111101011101000001011000111101011101000001011001011100111101001011001110011101000101100011000011011101011101000001010000011100111100110011011110101011110 e5bc94eba0b2e58b96ecbe85eba08ee5b1afeba0b1eba0b2e7a59ce8b186eba0a0e9ad84e5bc94eba0b2e58b96ecbe85eba08ee5b1afeba0b1eba0b2e7a59ce8b186eba0a0e799bd5e
UHC 弔렲勖쾅렎屯렱렲祜豆렠魄弔렲勖쾅렎屯렱렲祜豆렠白^ 11110000110000001000111010111111111010011110110111000100111001111000111010100100110101001110101010001110101111101000111010111111111110111101010011010100111001111000111010110001110110111101111011110000110000001000111010111111111010011110110111000100111001111000111010100100110101001110101010001110101111101000111010111111111110111101010011010100111001111000111010110001110110111101110001011110 f0c08ebfe9edc4e78ea4d4ea8ebe8ebffbd4d4e78eb1dbdef0c08ebfe9edc4e78ea4d4ea8ebe8ebffbd4d4e78eb1dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)