To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 役??蟻??蟻??[役??蟻??蟻??[^ 100101101111000000111111001111111000101101100001001111110011111110001011011000010011111100111111010110111001011011110000001111110011111110001011011000010011111100111111100010110110000100111111001111110101101101011110 96f03f3f8b613f3f8b613f3f5b96f03f3f8b613f3f8b613f3f5b5e
EUC-JP 役??蟻??蟻??[役??蟻??蟻??[^ 110011001111001000111111001111111011010111000010001111110011111110110101110000100011111100111111010110111100110011110010001111110011111110110101110000100011111100111111101101011100001000111111001111110101101101011110 ccf23f3fb5c23f3fb5c23f3f5bccf23f3fb5c23f3fb5c23f3f5b5e
UTF-8 役당쿊蟻쒖옖蟻귨쫷[役당쿊蟻쒖옖蟻귨쫷[^ 111001011011110110111001111010111000101110111001111011001011111110001010111010001001111110111011111011001001001010010110111011001001100010010110111010001001111110111011111010101011011110101000111011001010101110110111010110111110010110111101101110011110101110001011101110011110110010111111100010101110100010011111101110111110110010010010100101101110110010011000100101101110100010011111101110111110101010110111101010001110110010101011101101110101101101011110 e5bdb9eb8bb9ecbf8ae89fbbec9296ec9896e89fbbeab7a8ecabb75be5bdb9eb8bb9ecbf8ae89fbbec9296ec9896e89fbbeab7a8ecabb75b5e
UHC 役당쿊蟻쒖옖蟻귨쫷[役당쿊蟻쒖옖蟻귨쫷[^ 111001101011010110110100111001111011001010011111111010111111110010011100111011001001111010011100111010111111110010000010111011111010011010001110010110111110011010110101101101001110011110110010100111111110101111111100100111001110110010011110100111001110101111111100100000101110111110100110100011100101101101011110 e6b5b4e7b29febfc9cec9e9cebfc82efa68e5be6b5b4e7b29febfc9cec9e9cebfc82efa68e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)