To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 悟??????ょ?}v悟??????ょ?}vB 100011001110010100111111001111110011111100111111001111110011111110000010111001010011111101111101011101101000110011100101001111110011111100111111001111110011111100111111100000101110010100111111011111010111011001000010 8ce53f3f3f3f3f3f82e53f7d768ce53f3f3f3f3f3f82e53f7d7642
EUC-JP 悟??渶???ょ?}v悟??渶???ょ?}vB 10111000111001110011111100111111100011111100011111101101001111110011111100111111101001001110011100111111011111010111011010111000111001110011111100111111100011111100011111101101001111110011111100111111101001001110011100111111011111010111011001000010 b8e73f3f8fc7ed3f3f3fa4e73f7d76b8e73f3f8fc7ed3f3f3fa4e73f7d7642
UTF-8 悟싨쪛渶싩퉽獵ょ쉿}v悟싨쪛渶싩퉽獵ょ쉿}vB 1110011010000010100111111110110010001011101010001110110010101010100110111110011010111000101101101110110010001011101010011110110110001001101111011110111110100110101001111110001110000010100001111110110010001001101111110111110101110110111001101000001010011111111011001000101110101000111011001010101010011011111001101011100010110110111011001000101110101001111011011000100110111101111011111010011010100111111000111000001010000111111011001000100110111111011111010111011001000010 e6829fec8ba8ecaa9be6b8b6ec8ba9ed89bdefa6a7e38287ec89bf7d76e6829fec8ba8ecaa9be6b8b6ec8ba9ed89bdefa6a7e38287ec89bf7d7642
UHC 悟싨쪛渶싩퉽獵ょ쉿}v悟싨쪛渶싩퉽獵ょ쉿}vB 1110011111110110100110101110011010100101100101001110011110110111100110101110011110111001100101011110011110100110101010101110011110111101101100100111110101110110111001111111011010011010111001101010010110010100111001111011011110011010111001111011100110010101111001111010011010101010111001111011110110110010011111010111011001000010 e7f69ae6a594e7b79ae7b995e7a6aae7bdb27d76e7f69ae6a594e7b79ae7b995e7a6aae7bdb27d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)