To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 猥??腰??訝?┃N}猥??腰??訝?┃N{^ 11100000110011100011111100111111100011011001100000111111001111111110011001100010001111111000010010101011010011100111110111100000110011100011111100111111100011011001100000111111001111111110011001100010001111111000010010101011010011100111101101011110 e0ce3f3f8d983f3fe6623f84ab4e7de0ce3f3f8d983f3fe6623f84ab4e7b5e
EUC-JP 猥??腰??訝?┃N}猥??腰??訝?┃N{^ 11100000110100000011111100111111101110011111100000111111001111111110101111000011001111111010100010101101010011100111110111100000110100000011111100111111101110011111100000111111001111111110101111000011001111111010100010101101010011100111101101011110 e0d03f3fb9f83f3febc33fa8ad4e7de0d03f3fb9f83f3febc33fa8ad4e7b5e
UTF-8 猥덁툑腰쇤뇠訝덆┃N}猥덁툑腰쇤뇠訝덆┃N{^ 1110011110001100101001011110101110001101100000011110110110001000100100011110100010000101101100001110110010000111101001001110101110000111101000001110100010101000100111011110101110001101100001101110001010010100100000110100111001111101111001111000110010100101111010111000110110000001111011011000100010010001111010001000010110110000111011001000011110100100111010111000011110100000111010001010100010011101111010111000110110000110111000101001010010000011010011100111101101011110 e78ca5eb8d81ed8891e885b0ec87a4eb87a0e8a89deb8d86e294834e7de78ca5eb8d81ed8891e885b0ec87a4eb87a0e8a89deb8d86e294834e7b5e
UHC 猥덁툑腰쇤뇠訝덆┃N}猥덁툑腰쇤뇠訝덆┃N{^ 1110100011100101100010001110010010111000100010001110100110100110101111001110100110000111100010001110010010111000100010001110100110100110101011010100111001111101111010001110010110001000111001001011100010001000111010011010011010111100111010011000011110001000111001001011100010001000111010011010011010101101010011100111101101011110 e8e588e4b888e9a6bce98788e4b888e9a6ad4e7de8e588e4b888e9a6bce98788e4b888e9a6ad4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)