To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 巡?世淵?誠淵?誠}巡?世淵?誠淵?誠{^ 100011111000010000111111100100001010001010010101101000110011111110010000101111011001010110100011001111111001000010111101011111011000111110000100001111111001000010100010100101011010001100111111100100001011110110010101101000110011111110010000101111010111101101011110 8f843f90a295a33f90bd95a33f90bd7d8f843f90a295a33f90bd95a33f90bd7b5e
EUC-JP 巡?世淵?誠淵?誠}巡?世淵?誠淵?誠{^ 101111011110010000111111110000001010010011001010101001010011111111000000101111111100101010100101001111111100000010111111011111011011110111100100001111111100000010100100110010101010010100111111110000001011111111001010101001010011111111000000101111110111101101011110 bde43fc0a4caa53fc0bfcaa53fc0bf7dbde43fc0a4caa53fc0bfcaa53fc0bf7b5e
UTF-8 巡섑世淵걄誠淵걄誠}巡섑世淵걄誠淵걄誠{^ 111001011011011110100001111011001000010010010001111001001011100010010110111001101011011110110101111010101011000110000100111010001010101010100000111001101011011110110101111010101011000110000100111010001010101010100000011111011110010110110111101000011110110010000100100100011110010010111000100101101110011010110111101101011110101010110001100001001110100010101010101000001110011010110111101101011110101010110001100001001110100010101010101000000111101101011110 e5b7a1ec8491e4b896e6b7b5eab184e8aaa0e6b7b5eab184e8aaa07de5b7a1ec8491e4b896e6b7b5eab184e8aaa0e6b7b5eab184e8aaa07b5e
UHC 巡섑世淵걄誠淵걄誠}巡섑世淵걄誠淵걄誠{^ 111000101101111010011000111011011110000110100110111001101101000010000001011011001110000110100100111001101101000010000001011011001110000110100100011111011110001011011110100110001110110111100001101001101110011011010000100000010110110011100001101001001110011011010000100000010110110011100001101001000111101101011110 e2de98ede1a6e6d0816ce1a4e6d0816ce1a47de2de98ede1a6e6d0816ce1a4e6d0816ce1a47b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)