To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????v??????????vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 淨?畯??弔?儀??v淨?畯??弔?儀??vB 10011111110001000011111111111011011011110011111100111111100100101010001000111111100010110101011000111111001111110111011010011111110001000011111111111011011011110011111100111111100100101010001000111111100010110101011000111111001111110111011001000010 9fc43ffb6f3f3f92a23f8b563f3f769fc43ffb6f3f3f92a23f8b563f3f7642
EUC-JP 淨?畯??弔?儀??v淨?畯??弔?儀??vB 110111101100011000111111100011111100110110111011001111110011111111000100101001000011111110110101101101110011111100111111011101101101111011000110001111111000111111001101101110110011111100111111110001001010010000111111101101011011011100111111001111110111011001000010 dec63f8fcdbb3f3fc4a43fb5b73f3f76dec63f8fcdbb3f3fc4a43fb5b73f3f7642
UTF-8 淨렠畯흔긺弔렲儀븀볕v淨렠畯흔긺弔렲儀븀볕vB 111001101011011110101000111010111010000010100000111001111001010110101111111011011001110110010100111010101011100010111010111001011011110010010100111010111010000010110010111001011000010010000000111010111011100010000000111010111011001110010101011101101110011010110111101010001110101110100000101000001110011110010101101011111110110110011101100101001110101010111000101110101110010110111100100101001110101110100000101100101110010110000100100000001110101110111000100000001110101110110011100101010111011001000010 e6b7a8eba0a0e795afed9d94eab8bae5bc94eba0b2e58480ebb880ebb39576e6b7a8eba0a0e795afed9d94eab8bae5bc94eba0b2e58480ebb880ebb3957642
UHC 淨렠畯흔긺弔렲儀븀볕v淨렠畯흔긺弔렲儀븀볕vB 11101111111001001000111010110001111100011110000111001000111001111011000111100111111100001100000010001110101111111110101111110000101110101110011110111010101101010111011011101111111001001000111010110001111100011110000111001000111001111011000111100111111100001100000010001110101111111110101111110000101110101110011110111010101101010111011001000010 efe48eb1f1e1c8e7b1e7f0c08ebfebf0bae7bab576efe48eb1f1e1c8e7b1e7f0c08ebfebf0bae7bab57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)