To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 俑?????饒??n}俑?????饒??n{^ 100110001101101000111111001111110011111100111111001111111110100101100000001111110011111101101110011111011001100011011010001111110011111100111111001111110011111111101001011000000011111100111111011011100111101101011110 98da3f3f3f3f3fe9603f3f6e7d98da3f3f3f3f3fe9603f3f6e7b5e
EUC-JP 俑?????饒??n}俑?????饒??n{^ 110100001101110000111111001111110011111100111111001111111111000111000001001111110011111101101110011111011101000011011100001111110011111100111111001111110011111111110001110000010011111100111111011011100111101101011110 d0dc3f3f3f3f3ff1c13f3f6e7dd0dc3f3f3f3f3ff1c13f3f6e7b5e
UTF-8 俑겼쳣曆꿩펵饒삣츒n}俑겼쳣曆꿩펵饒삣츒n{^ 1110010010111111100100011110101010110010101111001110110010110011101000111110111110100110100010111110101010111111101010011110110110001110101101011110100110100101100100101110110010000010101000111110110010111000100100100110111001111101111001001011111110010001111010101011001010111100111011001011001110100011111011111010011010001011111010101011111110101001111011011000111010110101111010011010010110010010111011001000001010100011111011001011100010010010011011100111101101011110 e4bf91eab2bcecb3a3efa68beabfa9ed8eb5e9a592ec82a3ecb8926e7de4bf91eab2bcecb3a3efa68beabfa9ed8eb5e9a592ec82a3ecb8926e7b5e
UHC 俑겼쳣曆꿩펵饒삣츒n}俑겼쳣曆꿩펵饒삣츒n{^ 1110100110110101101100001110010110101011100010011110011010110111101100101110011010111100100001101110100110101110101110111110010110101110100011010110111001111101111010011011010110110000111001011010101110001001111001101011011110110010111001101011110010000110111010011010111010111011111001011010111010001101011011100111101101011110 e9b5b0e5ab89e6b7b2e6bc86e9aebbe5ae8d6e7de9b5b0e5ab89e6b7b2e6bc86e9aebbe5ae8d6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)