To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?謔荊?荊?荊?去?謔荊?荊?荊?醵^ 0011111111100110100000101000110001110100001111111000110001110100001111111000110001110100001111111000101110001110001111111110011010000010100011000111010000111111100011000111010000111111100011000111010000111111111001111101000101011110 3fe6828c743f8c743f8c743f8b8e3fe6828c743f8c743f8c743fe7d15e
EUC-JP ?謔荊?荊?荊?去?謔荊?荊?荊?醵^ 0011111111101011111000101011011111010101001111111011011111010101001111111011011111010101001111111011010111101110001111111110101111100010101101111101010100111111101101111101010100111111101101111101010100111111111011101101001101011110 3febe2b7d53fb7d53fb7d53fb5ee3febe2b7d53fb7d53fb7d53feed35e
UTF-8 뤋謔荊귑荊꾜荊㈈去뤋謔荊귑荊꾜荊㈈醵^ 11101011101001001000101111101000101011001001010011101000100011011000101011101010101101111001000111101000100011011000101011101010101111101001110011101000100011011000101011100011100010001000100011100101100011101011101111101011101001001000101111101000101011001001010011101000100011011000101011101010101101111001000111101000100011011000101011101010101111101001110011101000100011011000101011100011100010001000100011101001100001101011010101011110 eba48be8ac94e88d8aeab791e88d8aeabe9ce88d8ae38888e58ebbeba48be8ac94e88d8aeab791e88d8aeabe9ce88d8ae38888e986b55e
UHC 뤋謔荊귑荊꾜荊㈈去뤋謔荊귑荊꾜荊㈈醵^ 10001111101110111111100111001100111110111010101010110001110100101111101110101010101100101101100011111011101010101010100110111001110010111101101110001111101110111111100111001100111110111010101010110001110100101111101110101010101100101101100011111011101010101010100110111001110010111101100101011110 8fbbf9ccfbaab1d2fbaab2d8fbaaa9b9cbdb8fbbf9ccfbaab1d2fbaab2d8fbaaa9b9cbd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)