To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 矮℡?鴨??昻??n}矮℡?鴨??昻??n{^ 11100001111000101000011110000100001111111000101010011011001111110011111111111010110100000011111100111111011011100111110111100001111000101000011110000100001111111000101010011011001111110011111111111010110100000011111100111111011011100111101101011110 e1e287843f8a9b3f3ffad03f3f6e7de1e287843f8a9b3f3ffad03f3f6e7b5e
EUC-JP 矮??鴨?????n}矮??鴨?????n{^ 111000101110010000111111001111111011001111111011001111110011111100111111001111110011111101101110011111011110001011100100001111110011111110110011111110110011111100111111001111110011111100111111011011100111101101011110 e2e43f3fb3fb3f3f3f3f3f6e7de2e43f3fb3fb3f3f3f3f3f6e7b5e
UTF-8 矮℡떉鴨뚪깴昻잌뇤n}矮℡떉鴨뚪깴昻잌뇤n{^ 1110011110011111101011101110001010000100101000011110101110010110100010011110100110110100101010001110101110011010101010101110101010111001101101001110011010011000101110111110110010011110100011001110101110000111101001000110111001111101111001111001111110101110111000101000010010100001111010111001011010001001111010011011010010101000111010111001101010101010111010101011100110110100111001101001100010111011111011001001111010001100111010111000011110100100011011100111101101011110 e79faee284a1eb9689e9b4a8eb9aaaeab9b4e698bbec9e8ceb87a46e7de79faee284a1eb9689e9b4a8eb9aaaeab9b4e698bbec9e8ceb87a46e7b5e
UHC 矮℡떉鴨뚪깴昻잌뇤n}矮℡떉鴨뚪깴昻잌뇤n{^ 1110100011100001101000101110010110001011100111111110010011100101100011001110100110000011101000101110010011101001100111111110010110000111100011000110111001111101111010001110000110100010111001011000101110011111111001001110010110001100111010011000001110100010111001001110100110011111111001011000011110001100011011100111101101011110 e8e1a2e58b9fe4e58ce983a2e4e99fe5878c6e7de8e1a2e58b9fe4e58ce983a2e4e99fe5878c6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)