To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 俉??節??譽??鼇??也??昻??要??B 1111101001100001001111110011111110010000110111110011111100111111111001101010001100111111001111111110101010000111001111110011111110010110111001110011111100111111111110101101000000111111001111111001011101110110001111110011111101000010 fa613f3f90df3f3fe6a33f3fea873f3f96e73f3ffad03f3f97763f3f42
EUC-JP 俉??節??譽??鼇??也?????要??B 1000111110110001101110110011111100111111110000001110000100111111001111111110110010100101001111110011111111110011111001110011111100111111110011001110100100111111001111110011111100111111001111111100110111010111001111110011111101000010 8fb1bb3f3fc0e13f3feca53f3ff3e73f3fcce93f3f3f3f3fcdd73f3f42
UTF-8 俉녑쪍節얍떉譽낂찣鼇덅뜵也뉛슭昻잒꼨要잒쮫B 11100100101111111000100111101011100001011001000111101100101010101000110111100111101011111000000011101100100101101000110111101011100101101000100111101000101011011011110111101011100000101000001011101100101100001010001111101001101111001000011111101011100011011000010111101011100111001011010111100100101110011001111111101011100010011001101111101100100010101010110111100110100110001011101111101100100111101001001011101010101111001010100011101000101001101000000111101100100111101001001011101100101011101010101101000010 e4bf89eb8591ecaa8de7af80ec968deb9689e8adbdeb8282ecb0a3e9bc87eb8d85eb9cb5e4b99feb899bec8aade698bbec9e92eabca8e8a681ec9e92ecaeab42
UHC 俉녑쪍節얍떉譽낂찣鼇덅뜵也뉛슭昻잒꼨要잒쮫B 11100111111010111011001111100101101001011000011111101111101111011011111011100101100010111001111111100111111000101000010111101001101010011001111111101000101010001000100011101000100011011011001111100101101001011000011111101111101111011011111011100100111010011001111111101000100001001000010111101001101010011001111111101000101010001000100001000010 e7ebb3e5a587efbdbee58b9fe7e285e9a99fe8a888e88db3e5a587efbdbee4e99fe88485e9a99fe8a88842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)