To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 汝??耶??蘊??n}汝??耶??蘊??n{^ 1001001111110000001111110011111110010110111010110011111100111111111001010101110100111111001111110110111001111101100100111111000000111111001111111001011011101011001111110011111111100101010111010011111100111111011011100111101101011110 93f03f3f96eb3f3fe55d3f3f6e7d93f03f3f96eb3f3fe55d3f3f6e7b5e
EUC-JP 汝??耶??蘊??n}汝??耶??蘊??n{^ 1100011011110010001111110011111111001100111011010011111100111111111010011011111000111111001111110110111001111101110001101111001000111111001111111100110011101101001111110011111111101001101111100011111100111111011011100111101101011110 c6f23f3fcced3f3fe9be3f3f6e7dc6f23f3fcced3f3fe9be3f3f6e7b5e
UTF-8 汝싨㉬耶섉㉬蘊딀튆n}汝싨㉬耶섉㉬蘊딀튆n{^ 1110011010110001100111011110110010001011101010001110001110001001101011001110100010000000101101101110110010000100100010011110001110001001101011001110100010011000100010101110101110010100100000001110110110001010100001100110111001111101111001101011000110011101111011001000101110101000111000111000100110101100111010001000000010110110111011001000010010001001111000111000100110101100111010001001100010001010111010111001010010000000111011011000101010000110011011100111101101011110 e6b19dec8ba8e389ace880b6ec8489e389ace8988aeb9480ed8a866e7de6b19dec8ba8e389ace880b6ec8489e389ace8988aeb9480ed8a866e7b5e
UHC 汝싨㉬耶섉㉬蘊딀튆n}汝싨㉬耶섉㉬蘊딀튆n{^ 1110011010100011100110101110011010101000101111011110010110101101100110001110011010101000101111011110100010110011100010101110011010111001100110110110111001111101111001101010001110011010111001101010100010111101111001011010110110011000111001101010100010111101111010001011001110001010111001101011100110011011011011100111101101011110 e6a39ae6a8bde5ad98e6a8bde8b38ae6b99b6e7de6a39ae6a8bde5ad98e6a8bde8b38ae6b99b6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)