To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???爾←??P←?喩?????爾←?喩? 0011111100111111001111111000111010100010100000011010100100111111001111111000001001101111100000011010100100111111100110100110011100111111001111110011111100111111001111111000111010100010100000011010100100111111100110100110011100111111 3f3f3f8ea281a93f3f826f81a93f9a673f3f3f3f3f8ea281a93f9a673f
EUC-JP ???爾←?洹P←?喩?????爾←?喩? 00111111001111110011111110111100101001001010001010101011001111111000111111000111101110101010001111010000101000101010101100111111110100111100100000111111001111110011111100111111001111111011110010100100101000101010101100111111110100111100100000111111 3f3f3fbca4a2ab3f8fc7baa3d0a2ab3fd3c83f3f3f3f3fbca4a2ab3fd3c83f
UTF-8 銳얜끂爾←땟洹P←땟喩뽰낍銳얜끂爾←땟喩뽓 111010011000101010110011111011001001011010011100111010111000000110000010111001111000100010111110111000101000011010010000111010111001010110011111111001101011010010111001111011111011110010110000111000101000011010010000111010111001010110011111111001011001011010101001111010111011110110110000111010111000001010001101111010011000101010110011111011001001011010011100111010111000000110000010111001111000100010111110111000101000011010010000111010111001010110011111111001011001011010101001111010111011110110010011 e98ab3ec969ceb8182e788bee28690eb959fe6b4b9efbcb0e28690eb959fe596a9ebbdb0eb828de98ab3ec969ceb8182e788bee28690eb959fe596a9ebbd93
UHC 銳얜끂爾←땟洹P←땟喩뽰낍銳얜끂爾←땟喩뽓 111001111110010110111110111010111000010110111000111011001011001110100001111001111011011010101101111010101011011110100011110100001010000111100111101101101010110111101010111001111001011011101100101100111010011111100111111001011011111011101011100001011011100011101100101100111010000111100111101101101010110111101010111001111001011011010000 e7e5beeb85b8ecb3a1e7b6adeab7a3d0a1e7b6adeae796ecb3a7e7e5beeb85b8ecb3a1e7b6adeae796d0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)