To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????管悠??鷹?????泣ゆ?管悠?? 001111110011111100111111001111110011111100111111100010101100011110010111010010010011111100111111100100011110100100111111001111110011111100111111001111111000101110000011100000101110010000111111100010101100011110010111010010010011111100111111 3f3f3f3f3f3f8ac797493f3f91e93f3f3f3f3f8b8382e43f8ac797493f3f
EUC-JP ???佾??管悠??鷹?????泣ゆ?管悠?? 0011111100111111001111111000111110110000111110110011111100111111101101001100100111001101101010100011111100111111110000101110101100111111001111110011111100111111001111111011010111100011101001001110011000111111101101001100100111001101101010100011111100111111 3f3f3f8fb0fb3f3fb4c9cdaa3f3fc2eb3f3f3f3f3fb5e3a4e63fb4c9cdaa3f3f
UTF-8 麗몃쓷佾쒏룚管悠끾뉩鷹꾨븕捻뀀뿫泣ゆ룚管悠끾쾮 111011111010011010001000111010111010101010000011111011001001001110110111111001001011110110111110111011001001001010001111111010111010001110011010111001111010111010100001111001101000001010100000111010111000000110111110111010111000100110101001111010011011011110111001111010101011111010101000111010111011100010010101111011111010011010100100111010111000000010000000111010111011111110101011111001101011001110100011111000111000001010000110111010111010001110011010111001111010111010100001111001101000001010100000111010111000000110111110111011001011111010101110 efa688ebaa83ec93b7e4bdbeec928feba39ae7aea1e682a0eb81beeb89a9e9b7b9eabea8ebb895efa6a4eb8080ebbfabe6b3a3e38286eba39ae7aea1e682a0eb81beecbeae
UHC 麗몃쓷佾쒏룚管悠끾뉩鷹꾨븕捻뀀뿫泣ゆ룚管悠끾쾮 11100110101100001011100011101011100111011001010011101100111010111001110011100110100011111001011011001110101101111110101011101101100001011110011010110100101110011110101111101101100001001110101110010101100000011110011011110111101100101110101110010111101010111110101111101000101010101110011010001111100101101100111010110111111010101110110110000101111001101011001010000101 e6b0b8eb9d94eceb9ce68f96ceb7eaed85e6b4b9ebed84eb9581e6f7b2eb97abebe8aae68f96ceb7eaed85e6b285

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)