To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ???茵??循??n}???茵??循??n{^ 001111110011111100111111111001001001111100111111001111111000111101111010001111110011111101101110011111010011111100111111001111111110010010011111001111110011111110001111011110100011111100111111011011100111101101011110 3f3f3fe49f3f3f8f7a3f3f6e7d3f3f3fe49f3f3f8f7a3f3f6e7b5e
EUC-JP ???茵??循??n}???茵??循??n{^ 001111110011111100111111111010001010000100111111001111111011110111011011001111110011111101101110011111010011111100111111001111111110100010100001001111110011111110111101110110110011111100111111011011100111101101011110 3f3f3fe8a13f3fbddb3f3f6e7d3f3f3fe8a13f3fbddb3f3f6e7b5e
UTF-8 麗멥굦茵먩갭循뚰겭n}麗멥굦茵먩갭循뚰겭n{^ 1110111110100110100010001110101110101001101001011110101010110101101001101110100010001100101101011110101110101000101010011110101010110000101011011110010110111110101010101110101110011010101100001110101010110010101011010110111001111101111011111010011010001000111010111010100110100101111010101011010110100110111010001000110010110101111010111010100010101001111010101011000010101101111001011011111010101010111010111001101010110000111010101011001010101101011011100111101101011110 efa688eba9a5eab5a6e88cb5eba8a9eab0ade5beaaeb9ab0eab2ad6e7defa688eba9a5eab5a6e88cb5eba8a9eab0ade5beaaeb9ab0eab2ad6e7b5e
UHC 麗멥굦茵먩갭循뚰겭n}麗멥굦茵먩갭循뚰겭n{^ 1110011010110000101110001110001110000010100011001110110011100000100100001110011010110000101110001110001011100000100011001110110110000001101110110110111001111101111001101011000010111000111000111000001010001100111011001110000010010000111001101011000010111000111000101110000010001100111011011000000110111011011011100111101101011110 e6b0b8e3828cece090e6b0b8e2e08ced81bb6e7de6b0b8e3828cece090e6b0b8e2e08ced81bb6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)