To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤??韋??循??冗??逾??巡?????異 1000110011101011001111110011111111101000111010000011111100111111100011110111101000111111001111111000111111100111001111110011111111100111101001010011111100111111100011111000010000111111001111110011111100111111001111111000100011011001 8ceb3f3fe8e83f3f8f7a3f3f8fe73f3fe7a53f3f8f843f3f3f3f3f88d9
EUC-JP 誤??韋??循??冗??逾??巡?????異 1011100011101101001111110011111111110000111010100011111100111111101111011101101100111111001111111011111011101001001111110011111111101110101001110011111100111111101111011110010000111111001111110011111100111111001111111011000011011011 b8ed3f3ff0ea3f3fbddb3f3fbee93f3feea73f3fbde43f3f3f3f3fb0db
UTF-8 誤곸룆韋귝쨫循녿겱冗밸맍逾뷴슖巡볥걙亮쇰뎿異 111010001010101010100100111010101011001110111000111010111010001110000110111010011001111110001011111010101011011110011101111011001010100010101011111001011011111010101010111010111000010110111111111010101011001010110001111001011000011010010111111010111011000010111000111010111010011110001101111010011000000010111110111010111011011110110100111011001000101010010110111001011011011110100001111010111011001110100101111010101011000110011001111011111010010110110111111011001000011110110000111010111000111010111111111001111001010110110000 e8aaa4eab3b8eba386e99f8beab79deca8abe5beaaeb85bfeab2b1e58697ebb0b8eba78de980beebb7b4ec8a96e5b7a1ebb3a5eab199efa5b7ec87b0eb8ebfe795b0
UHC 誤곸룆韋귝쨫循녿겱冗밸맍逾뷴슖巡볥걙亮쇰뎿異 1110100010100110100000011110110010001111100001011110101011011111100000101110011010100100100001011110001011100000100001101110101110000001101111011110100110110111101110011110101110010000101001001110101110110101101110101110010110011010101001011110001011011110100100111110101110000001100000111110010110111001101111001110101110001001100100101110110010110110 e8a681ec8f85eadf82e6a485e2e086eb81bde9b7b9eb90a4ebb5bae59aa5e2de93eb8183e5b9bceb8992ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)