To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 沃?????幽??筌??諭ε?諛??億?? 10010111100000000011111100111111001111110011111100111111100101110100100000111111001111111110001010100011001111110011111110010111010000001000001111000011001111111110011010000111001111110011111110001001101011010011111100111111 97803f3f3f3f3f97483f3fe2a33f3f974083c33fe6873f3f89ad3f3f
EUC-JP 沃??堉??幽??筌??諭ε?諛??億?? 110011011110000000111111001111111000111110110111111111010011111100111111110011011010100100111111001111111110010010100101001111110011111111001101101000011010011011000101001111111110101111100111001111110011111110110010101011110011111100111111 cde03f3f8fb7fd3f3fcda93f3fe4a53f3fcda1a6c53febe73f3fb2af3f3f
UTF-8 沃욌쪇堉싩뙴幽껊겱筌먯빖諭ε넇諛몃룇億됱걖 1110011010110010100000111110110010011010100011001110110010101010100001111110010110100000100010011110110010001011101010011110101110011001101101001110010110111001101111011110101010111011100010101110101010110010101100011110011110101101100011001110101110101000101011111110101110111001100101101110100010101011101011011100111010110101111010111000010010000111111010001010101110011011111010111010101010000011111010111010001110000111111001011000010010000100111010111001000010110001111010101011000110010110 e6b283ec9a8cecaa87e5a089ec8ba9eb99b4e5b9bdeabb8aeab2b1e7ad8ceba8afebb996e8abadceb5eb8487e8ab9bebaa83eba387e58484eb90b1eab196
UHC 沃욌쪇堉싩뙴幽껊겱筌먯빖諭ε넇諛몃룇億됱걖 111010001010101010011110111010111010010110000001111010111011110010011010111001111000110010110111111010101110101110000011111010111000000110111101111011111010011110010000111011001001010110111000111010111011000110100101111001011000011010010111111010111011000010111000111010111000111110000110111001011110001010001001111011001000000110000001 e8aa9eeba581ebbc9ae78cb7eaeb83eb81bdefa790ec95b8ebb1a5e58697ebb0b8eb8f86e5e289ec8181

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)