To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z}?????????z{^ 0011111100111111001111110011111100111111001111110011111100111111001111110111101001111101001111110011111100111111001111110011111100111111001111110011111100111111011110100111101101011110 3f3f3f3f3f3f3f3f3f7a7d3f3f3f3f3f3f3f3f3f7a7b5e
SJIS-WIN ???????ら?z}???????ら?z{^ 00111111001111110011111100111111001111110011111100111111100000101110011100111111011110100111110100111111001111110011111100111111001111110011111100111111100000101110011100111111011110100111101101011110 3f3f3f3f3f3f3f82e73f7a7d3f3f3f3f3f3f3f82e73f7a7b5e
EUC-JP 渶??????ら?z}渶??????ら?z{^ 1000111111000111111011010011111100111111001111110011111100111111001111111010010011101001001111110111101001111101100011111100011111101101001111110011111100111111001111110011111100111111101001001110100100111111011110100111101101011110 8fc7ed3f3f3f3f3f3fa4e93f7a7d8fc7ed3f3f3f3f3f3fa4e93f7a7b5e
UTF-8 渶싮떎僚묌퍖獵ら쪓z}渶싮떎僚묌퍖獵ら쪓z{^ 1110011010111000101101101110110010001011101011101110101110010110100011101110111110100110101110111110101110101100100011001110110110001101100101101110111110100110101001111110001110000010100010011110110010101010100100110111101001111101111001101011100010110110111011001000101110101110111010111001011010001110111011111010011010111011111010111010110010001100111011011000110110010110111011111010011010100111111000111000001010001001111011001010101010010011011110100111101101011110 e6b8b6ec8baeeb968eefa6bbebac8ced8d96efa6a7e38289ecaa937a7de6b8b6ec8baeeb968eefa6bbebac8ced8d96efa6a7e38289ecaa937a7b5e
UHC 渶싮떎僚묌퍖獵ら쪓z}渶싮떎僚묌퍖獵ら쪓z{^ 1110011110110111100110101110100110001011101001001110100011101000100100011110100110111011100011011110011110100110101010101110100110100101100011010111101001111101111001111011011110011010111010011000101110100100111010001110100010010001111010011011101110001101111001111010011010101010111010011010010110001101011110100111101101011110 e7b79ae98ba4e8e891e9bb8de7a6aae9a58d7a7de7b79ae98ba4e8e891e9bb8de7a6aae9a58d7a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)