To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???節??侑??}???節??侑??{^ 00111111001111110011111110010000110111110011111100111111100110001101000000111111001111110111110100111111001111110011111110010000110111110011111100111111100110001101000000111111001111110111101101011110 3f3f3f90df3f3f98d03f3f7d3f3f3f90df3f3f98d03f3f7b5e
EUC-JP ???節??侑??}???節??侑??{^ 00111111001111110011111111000000111000010011111100111111110100001101001000111111001111110111110100111111001111110011111111000000111000010011111100111111110100001101001000111111001111110111101101011110 3f3f3fc0e13f3fd0d23f3f7d3f3f3fc0e13f3fd0d23f3f7b5e
UTF-8 樂뀀쑚節꾨쨱侑ㅽ릨}樂뀀쑚節꾨쨱侑ㅽ릨{^ 111011111010011010111111111010111000000010000000111011001001000110011010111001111010111110000000111010101011111010101000111011001010100010110001111001001011111010010001111000111000010110111101111010111010011010101000011111011110111110100110101111111110101110000000100000001110110010010001100110101110011110101111100000001110101010111110101010001110110010101000101100011110010010111110100100011110001110000101101111011110101110100110101010000111101101011110 efa6bfeb8080ec919ae7af80eabea8eca8b1e4be91e385bdeba6a87defa6bfeb8080ec919ae7af80eabea8eca8b1e4be91e385bdeba6a87b5e
UHC 樂뀀쑚節꾨쨱侑ㅽ릨}樂뀀쑚節꾨쨱侑ㅽ릨{^ 111010001111100110110010111010111001110010111001111011111011110110000100111010111010010010001011111010101110001010100100111011011001000010001010011111011110100011111001101100101110101110011100101110011110111110111101100001001110101110100100100010111110101011100010101001001110110110010000100010100111101101011110 e8f9b2eb9cb9efbd84eba48beae2a4ed908a7de8f9b2eb9cb9efbd84eba48beae2a4ed908a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)