To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 円??甕?????[円??甕?????[^ 10001001011111100011111100111111111000010101000000111111001111110011111100111111001111110101101110001001011111100011111100111111111000010101000000111111001111110011111100111111001111110101101101011110 897e3f3fe1503f3f3f3f3f5b897e3f3fe1503f3f3f3f3f5b5e
EUC-JP 円??甕?????[円??甕?????[^ 10110001110111110011111100111111111000011011000100111111001111110011111100111111001111110101101110110001110111110011111100111111111000011011000100111111001111110011111100111111001111110101101101011110 b1df3f3fe1b13f3f3f3f3f5bb1df3f3fe1b13f3f3f3f3f5b5e
UTF-8 円띨뜐甕닸쇂欌볢릿[円띨뜐甕닸쇂欌볢릿[^ 111001011000011010000110111010111001110110101000111010111001110010010000111001111001010010010101111010111000101110111000111011001000011110000010111001101010110010001100111010111011001110100010111010111010011010111111010110111110010110000110100001101110101110011101101010001110101110011100100100001110011110010100100101011110101110001011101110001110110010000111100000101110011010101100100011001110101110110011101000101110101110100110101111110101101101011110 e58686eb9da8eb9c90e79495eb8bb8ec8782e6ac8cebb3a2eba6bf5be58686eb9da8eb9c90e79495eb8bb8ec8782e6ac8cebb3a2eba6bf5b5e
UHC 円띨뜐甕닸쇂欌볢릿[円띨뜐甕닸쇂欌볢릿[^ 111001011111011110110110111011101000110110010011111010001011100010110100111001101001100110110110111011011110101110010011111010001011100010110100010110111110010111110111101101101110111010001101100100111110100010111000101101001110011010011001101101101110110111101011100100111110100010111000101101000101101101011110 e5f7b6ee8d93e8b8b4e699b6edeb93e8b8b45be5f7b6ee8d93e8b8b4e699b6edeb93e8b8b45b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)