To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????v???????????vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 淨?寃??弔????大v淨?寃??弔????大vB 100111111100010000111111100110111000001100111111001111111001001010100010001111110011111100111111001111111001000111100101011101101001111111000100001111111001101110000011001111110011111110010010101000100011111100111111001111110011111110010001111001010111011001000010 9fc43f9b833f3f92a23f3f3f3f91e5769fc43f9b833f3f92a23f3f3f3f91e57642
EUC-JP 淨?寃??弔?勖??大v淨?寃??弔?勖??大vB 11011110110001100011111111010101111000110011111100111111110001001010010000111111100011111011001111101101001111110011111111000010111001110111011011011110110001100011111111010101111000110011111100111111110001001010010000111111100011111011001111101101001111110011111111000010111001110111011001000010 dec63fd5e33f3fc4a43f8fb3ed3f3fc2e776dec63fd5e33f3fc4a43f8fb3ed3f3fc2e77642
UTF-8 淨렠寃닺백弔렲勖쾅렎大v淨렠寃닺백弔렲勖쾅렎大vB 111001101011011110101000111010111010000010100000111001011010111110000011111010111000101110111010111010111011000010110001111001011011110010010100111010111010000010110010111001011000101110010110111011001011111010000101111010111010000010001110111001011010010010100111011101101110011010110111101010001110101110100000101000001110010110101111100000111110101110001011101110101110101110110000101100011110010110111100100101001110101110100000101100101110010110001011100101101110110010111110100001011110101110100000100011101110010110100100101001110111011001000010 e6b7a8eba0a0e5af83eb8bbaebb0b1e5bc94eba0b2e58b96ecbe85eba08ee5a4a776e6b7a8eba0a0e5af83eb8bbaebb0b1e5bc94eba0b2e58b96ecbe85eba08ee5a4a77642
UHC 淨렠寃닺백弔렲勖쾅렎大v淨렠寃닺백弔렲勖쾅렎大vB 1110111111100100100011101011000111101010101100101011010011101000101110011110100111110000110000001000111010111111111010011110110111000100111001111000111010100100110100111101111001110110111011111110010010001110101100011110101010110010101101001110100010111001111010011111000011000000100011101011111111101001111011011100010011100111100011101010010011010011110111100111011001000010 efe48eb1eab2b4e8b9e9f0c08ebfe9edc4e78ea4d3de76efe48eb1eab2b4e8b9e9f0c08ebfe9edc4e78ea4d3de7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)