To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 欲??泣?????n}欲??泣?????n{^ 100101110111111000111111001111111000101110000011001111110011111100111111001111110011111101101110011111011001011101111110001111110011111110001011100000110011111100111111001111110011111100111111011011100111101101011110 977e3f3f8b833f3f3f3f3f6e7d977e3f3f8b833f3f3f3f3f6e7b5e
EUC-JP 欲??泣?????n}欲??泣?????n{^ 110011011101111100111111001111111011010111100011001111110011111100111111001111110011111101101110011111011100110111011111001111110011111110110101111000110011111100111111001111110011111100111111011011100111101101011110 cddf3f3fb5e33f3f3f3f3f6e7dcddf3f3fb5e33f3f3f3f3f6e7b5e
UTF-8 欲뀁뼲泣앯꽣紐뚯댋n}欲뀁뼲泣앯꽣紐뚯댋n{^ 1110011010101100101100101110101110000000100000011110101110111100101100101110011010110011101000111110110010010101101011111110101010111101101000111110111110100111100011111110101110011010101011111110101110001100100010110110111001111101111001101010110010110010111010111000000010000001111010111011110010110010111001101011001110100011111011001001010110101111111010101011110110100011111011111010011110001111111010111001101010101111111010111000110010001011011011100111101101011110 e6acb2eb8081ebbcb2e6b3a3ec95afeabda3efa78feb9aafeb8c8b6e7de6acb2eb8081ebbcb2e6b3a3ec95afeabda3efa78feb9aafeb8c8b6e7b5e
UHC 欲뀁뼲泣앯꽣紐뚯댋n}欲뀁뼲泣앯꽣紐뚯댋n{^ 1110100110110000101100101110110010010110101101011110101111101000100111011110011110000100101100001110101110101010100011001110110010001000101101000110111001111101111010011011000010110010111011001001011010110101111010111110100010011101111001111000010010110000111010111010101010001100111011001000100010110100011011100111101101011110 e9b0b2ec96b5ebe89de784b0ebaa8cec88b46e7de9b0b2ec96b5ebe89de784b0ebaa8cec88b46e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)