To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???znf???zn^}Y???znf???zn^}bE 0011111100111111001111110111101001101110011001100011111100111111001111110111101001101110010111100111110101011001001111110011111100111111011110100110111001100110001111110011111100111111011110100110111001011110011111010110001001000101 3f3f3f7a6e663f3f3f7a6e5e7d593f3f3f7a6e663f3f3f7a6e5e7d6245
SJIS-WIN 北鏥悶znf北鏥悶zn^}Y北鏥悶znf北鏥悶zn^}bE 1001011001101011111010000101010010010110111000110111101001101110011001101001011001101011111010000101010010010110111000110111101001101110010111100111110101011001100101100110101111101000010101001001011011100011011110100110111001100110100101100110101111101000010101001001011011100011011110100110111001011110011111010110001001000101 966be85496e37a6e66966be85496e37a6e5e7d59966be85496e37a6e66966be85496e37a6e5e7d6245
EUC-JP 北鏥悶znf北鏥悶zn^}Y北鏥悶znf北鏥悶zn^}bE 1100101111001100111011111011010111001100111001010111101001101110011001101100101111001100111011111011010111001100111001010111101001101110010111100111110101011001110010111100110011101111101101011100110011100101011110100110111001100110110010111100110011101111101101011100110011100101011110100110111001011110011111010110001001000101 cbccefb5cce57a6e66cbccefb5cce57a6e5e7d59cbccefb5cce57a6e66cbccefb5cce57a6e5e7d6245
UTF-8 北鏥悶znf北鏥悶zn^}Y北鏥悶znf北鏥悶zn^}bE 1110010110001100100101111110100110001111101001011110011010000010101101100111101001101110011001101110010110001100100101111110100110001111101001011110011010000010101101100111101001101110010111100111110101011001111001011000110010010111111010011000111110100101111001101000001010110110011110100110111001100110111001011000110010010111111010011000111110100101111001101000001010110110011110100110111001011110011111010110001001000101 e58c97e98fa5e682b67a6e66e58c97e98fa5e682b67a6e5e7d59e58c97e98fa5e682b67a6e66e58c97e98fa5e682b67a6e5e7d6245
UHC 北?悶znf北?悶zn^}Y北?悶znf北?悶zn^}bE 11011101110000010011111111011010101111110111101001101110011001101101110111000001001111111101101010111111011110100110111001011110011111010101100111011101110000010011111111011010101111110111101001101110011001101101110111000001001111111101101010111111011110100110111001011110011111010110001001000101 ddc13fdabf7a6e66ddc13fdabf7a6e5e7d59ddc13fdabf7a6e66ddc13fdabf7a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)