To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 上ユシャ丈ヲ疾自n}上ユシャ丈ヲ疾自n{^ 100011111110001111010101101111001010110010001111111001001010011010001110101111101111001011000110100011101010100101101110011111011000111111100011110101011011110010101100100011111110010010100110100011101011111011110010110001101000111010101001011011100111101101011110 8fe3d5bcac8fe4a68ebef2c68ea96e7d8fe3d5bcac8fe4a68ebef2c68ea96e7b5e
EUC-JP 上ユシャ丈ヲ疾?自n}上ユシャ丈ヲ疾?自n{^ 101111101110010110001110110101011000111010111100100011101010110010111110111001101000111010100110101111001100000000111111101111001010101101101110011111011011111011100101100011101101010110001110101111001000111010101100101111101110011010001110101001101011110011000000001111111011110010101011011011100111101101011110 bee58ed58ebc8eacbee68ea6bcc03fbcab6e7dbee58ed58ebc8eacbee68ea6bcc03fbcab6e7b5e
UTF-8 上ユシャ丈ヲ疾自n}上ユシャ丈ヲ疾自n{^ 1110010010111000100010101110111110111110100101011110111110111101101111001110111110111101101011001110010010111000100010001110111110111101101001101110011110010110101111101110111010000111101111011110100010000111101010100110111001111101111001001011100010001010111011111011111010010101111011111011110110111100111011111011110110101100111001001011100010001000111011111011110110100110111001111001011010111110111011101000011110111101111010001000011110101010011011100111101101011110 e4b88aefbe95efbdbcefbdace4b888efbda6e796beee87bde887aa6e7de4b88aefbe95efbdbcefbdace4b888efbda6e796beee87bde887aa6e7b5e
UHC 上???丈?疾?自n}上???丈?疾?自n{^ 11011111101111100011111100111111001111111110110111011011001111111111001011110000001111111110110110111011011011100111110111011111101111100011111100111111001111111110110111011011001111111111001011110000001111111110110110111011011011100111101101011110 dfbe3f3f3feddb3ff2f03fedbb6e7ddfbe3f3f3feddb3ff2f03fedbb6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)