To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 蜈???ヨ?哀??n}蜈???ヨ?哀??n{^ 1110010110000101001111110011111100111111100000111000100000111111100010001010001100111111001111110110111001111101111001011000010100111111001111110011111110000011100010000011111110001000101000110011111100111111011011100111101101011110 e5853f3f3f83883f88a33f3f6e7de5853f3f3f83883f88a33f3f6e7b5e
EUC-JP 蜈???ヨ?哀??n}蜈???ヨ?哀??n{^ 1110100111100101001111110011111100111111101001011110100000111111101100001010010100111111001111110110111001111101111010011110010100111111001111110011111110100101111010000011111110110000101001010011111100111111011011100111101101011110 e9e53f3f3fa5e83fb0a53f3f6e7de9e53f3f3fa5e83fb0a53f3f6e7b5e
UTF-8 蜈곫녂略ヨ뮸哀앲꼨n}蜈곫녂略ヨ뮸哀앲꼨n{^ 1110100010011100100010001110101010110011101010111110101110000101100000101110111110100101101101101110001110000011101010001110101110101110101110001110010110010011100000001110110010010101101100101110101010111100101010000110111001111101111010001001110010001000111010101011001110101011111010111000010110000010111011111010010110110110111000111000001110101000111010111010111010111000111001011001001110000000111011001001010110110010111010101011110010101000011011100111101101011110 e89c88eab3abeb8582efa5b6e383a8ebaeb8e59380ec95b2eabca86e7de89c88eab3abeb8582efa5b6e383a8ebaeb8e59380ec95b2eabca86e7b5e
UHC 蜈곫녂略ヨ뮸哀앲꼨n}蜈곫녂略ヨ뮸哀앲꼨n{^ 1110100010100101100000011110011010000110101110101110010110110010101010111110100010010010101111111110010011101110100111011110100010000100100001010110111001111101111010001010010110000001111001101000011010111010111001011011001010101011111010001001001010111111111001001110111010011101111010001000010010000101011011100111101101011110 e8a581e686bae5b2abe892bfe4ee9de884856e7de8a581e686bae5b2abe892bfe4ee9de884856e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)