To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8踰??議???ル〕日??臾??繹?ぉ 11100001100111110011111110000010010101111110011011111010001111110011111110001011011000110011111100111111001111111000001110001011100000010110110010010011111110100011111100111111111001000110101100111111001111111110001110001000001111111000001010100111 e19f3f8257e6fa3f3f8b633f3f3f838b816c93fa3f3fe46b3f3fe3883f82a7
EUC-JP 癲?8踰??議???ル〕日??臾??繹?ぉ 11100010101000010011111110100011101110001110110011111100001111110011111110110101110001000011111100111111001111111010010111101011101000011100110111000110111111000011111100111111111001111100110000111111001111111110010111101000001111111010010010101001 e2a13fa3b8ecfc3f3fb5c43f3f3fa5eba1cdc6fc3f3fe7cc3f3fe5e83fa4a9
UTF-8 癲쒕8踰앯럤議욧콟曆ル〕日딃뼮臾롪뻗繹먯ぉ 111001111001100110110010111011001001001010010101111011111011110010011000111010001011100010110000111011001001010110101111111010111001111110100100111010001010110110110000111011001001101010100111111011001011110110011111111011111010011010001011111000111000001110101011111000111000000010010101111001101001011110100101111010111001010010000011111010111011110010101110111010001000011110111110111010111010000110101010111010111011101110010111111001111011100110111001111010111010100010101111111000111000000110001001 e799b2ec9295efbc98e8b8b0ec95afeb9fa4e8adb0ec9aa7ecbd9fefa68be383abe38095e697a5eb9483ebbcaee887beeba1aaebbb97e7b9b9eba8afe38189
UHC 癲쒕8踰앯럤議욧콟曆ル〕日딃뼮臾롪뻗繹먯ぉ 111011111010011010011100111010111010001110111000111010111011001010011101111001111000111010000111111011001010000110111111111010101011000110010111111001101011011110101011111010111010000110110011111011001110110110001010111010011001011010110001111010111010110010001110111010101011101110111000111001101011101010010000111011001010101010101001 efa69ceba3b8ebb29de78e87eca1bfeab197e6b7abeba1b3eced8ae996b1ebac8eeabbb8e6ba90ecaaa9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)